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ABSTRACT 


is 


y  To  a  multivariate  stationary  stochastic  process ,  -we  associate^  scattering 
matrix  S,  which  measures  the  interaction  between  the  past  and  future  . 
of  the  process.  This  matrix  valued  function  can  be  viewed  as  the 
generalized  phase  function  associated  with  the  spectral  density.  It 
determines  the  density  up  to  congruency  only  for  a  completely 
non- deterministic  sequence.  ^  <~ 

Kfoi-t  rX  j  7 

Using  the  theory  of  A4amjan-Arov-Krein  on  extensions  of  Hankel 
operators ,  -we  establ^hfithat  the  Hankel  operator  H£  determines  the 
Laurent  operator  Lg  as  its  unique  norm  preserving  lifting.  Employing 
the  Nagy-Foia§  theory  on  jijnitary  dilations ,  or  its  dual,  Lax-Phillips 
scattering  operator  model  ,^we  develop'' a  realization  theory  for  equivalent 
classes  of  stationary  sequences  with  the  same  density^  The  minimal 
equivalence  class  of  Markovian  representations  is  induced  by  the  coprime 
factorization  of  the  scattering  matrix.  This  presents  a  unified  approach 
to  stochastic  and  deterministic  realization  theory,  with  S  as  the  analog 
of  the  frequency  response  function. 

h,  *..tu 

To  obtain  reduced  order  models ,  we  approximates^the  given  sequence 
with  a  jointly  stationary  one  of  a  lower  dimensional  state  space, 
minimizing  the  distance  between  the  two  sequences.  The  solution  which 
involves  Hs  is  non-constructive.  We  pose  a  weakeryersion ,  leading  to 
a  Hankel  approximation  of  Hg.  The  algorithm  employs\the  analytic 
properties  of  Schmidt  pairs  for  a  Hankel  operator.  An^rror  bound  on 
the  normalized  difference  between  the  covariance  function^  of  the  two 
sequences  is  obtained.  \ 


4.'* 
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£ 


INTRODUCTION 


The  work  of  Adamjan-Arov  on  scattering  [2],  and  their  subsequent 
investigations  of  Hankel  operators  and  various  approximation,  inter¬ 
polation,  and  extension  problems  connected  with  them  [1],  [3]  -  [7], 
has  by  and  large  not  been  used  for  the  study  of  stationary  discrete  time 
stochastic  processes  (stationary  sequences).  Our  work  is  a  study  of 
the  ramifications  of  their  work  in  the  context  of  stationary  sequences. 

For  deterministic  linear  systems,  their  work  on  Hankel  norm  approximations 
has  recently  received  considerable  attention.  In  our  view,  this  is 
due  to  the  fact  that  the  input-output  approach  gives  the  Hankel  norm 
approximation  a  physically  appealing  interpretation  (see  Chapter  8). 

For  stationary  sequences,  however,  the  probabilistic  information  is 
available  in  the  form  of  the  spectral  density  fyy  of  the  p-dimensional 
sequence  {y(n»“oc  (or  equivalently  the  covariance  function)  and  there 
is  no  obvious  candidate  which  plays  the  role  of  an  input-output  map. 

This  is  now  well  known,  for  example,  in  the  context  of  stochastic 
realization  theory  (cf.  also  the  work  of  Willems  [42]).  Looking  for  an 
object  which  plays  the  role  of  an  input-output  map  has  some  conceptual, 
as  well  as  practical,  ramifications.  Conceptually,  it  presents  us  with 
a  unified  approach  to  deterministic  and  stochastic  realization  theory. 

On  a  more  practical  level,  it  provides  us  with  some  justification  as  to 
'what'  should  be  approximated  when  dealing  with  the  model  reduction 
problem. 
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In  this  study,  we  associate  with  every  centered  regular  full  rank 
stationary  process  y  a  scattering  matrix  S  which  measures  the  interaction 
between  the  past  and  future  of  the  process.  It  may  be  considered  to  be 
a  mapping  which  maps  the  backward  innovations  of  the  process  to  the 
forward  innovations  of  the  process  (see  Chapter  3).  In  the  1-dimensional 
case,  this  scattering  function  is  essentially  the  phase  function  of  the 
outer  factor  of  the  density  (and  hence  can  be  considered  as  the  phase 
function  associated  with  the  density) . 

The  question  as  to  when  the  phase  function  (for  a  1-dimensional 
process)  determines  the  density  of  the  process  (up  to  a  positive  scalar 
multiple)  was  investigated  by  Levinson-McKean  [28],  They  proved  that 
this  is  true  iff  (if  and  only  if)  the  process  has  the  completely  non- 
deterministic  property,  namely,  no  value  in  its  (strict)  future  space 
(that  is,  the  space  spanned  by  the  values  y(l) ,  y ( 2) ,  .  .  .)  can  be 
linearly  predicted  without  error  based  on  its  past  space  (that  is,  the 
space  spanned  by  y(0),  y(-l),  .  .  .  ).  This  property  is  stronger  than 
regularity  in  which  no  future  value  of  the  process  can  be  linearly  predicted 
without  error  based  on  its  past  space  [9].  For  the  vector  case,  not  only 
does  the  approach  in  [28]  not  generalize,  but,  in  addition,  it  is  not 
clear  what  is  the  right  notion  of  a  phase  (matrix  valued)  function  to 
associate  with  a  density.  That  our  scattering  matrix  S  can  be  viewed  as 
the  right  generalization  can  be  supported  by  the  fact  that  S  determines 
the  density  up  to  congruency  (that  is,  up  to  the  form  K*f^A)K)  iff  the 
process  is  completely  non-deterministic  (Theorems  4.5,  4.8).  Those 
scattering  matrices  which  correspond  to  regular  full  rank  completely 
non-deterministic  processes  are  characterized  in  Theorem  4.13. 

In  deterministic  systems  realization  theory,  for  a  causal  frequency 
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response  function  4>  we  have  the  general  principle  [23,  p.  24]  that  the 

Hankel  operator  H  (the  composition  of  the  reachability  and  observability 

maps)  determines  (the  causal)  q  up  to  an  additive  constant.  Assuming 

4>(0)  =  [0]  ,  the  general  principle  can  be  rephrased  by  saying  that 

Ha  admits  a  unique  causal  lifting  to  the  Laurent  operator  (input-output 

map).  In  the  case  of  stationary  sequences  the  Hankel  operator 

+ 

admits  a  unique  norm  preserving  lifting  (Theorem  4.4).  This 

property  can  be  given  a  variational  characterization ,  namely ,  S  is  the 

++ 

unique  solution  to 

minillFll^  :  Fe  LJB(CP) ) ,  ck(F)  =  cR(S)  ,  k  =  -1,  -2,  ...  } 

where  ck(F)  is  the  k-th  Fourier  coefficient  of  F. 

That  the  scattering  matrix  can  be  viewed  as  an  analog  to  the 
frequency  response  function  in  systems  is  further  strengthened  by  the 
role  we  show  it  plays  in  Markovian  representations  (realizations)  of 
stationary  sequences.  A  subspace  X  in  the  span  Hy  of  y  is  a  Markovian 
representation  for  the  sequence  {y(n)}_°^  if  the  process  {Un X 
(U  being  the  shift  on  H^)  has  the  (weak)  Markov  property  (Definition 
6.1),  and  y..(0),  ....  y  (0)  are  contained  in  X.  The  dynamical 

A  r 

representations  are  thereby  readily  obtained  (Sections  6,7). 

Ruckebusch  [37],  Lindquist-Picci  [23]  develop  a  realization  theory 
for  stationary  sequences  in  which  every  Markovian  representation  is  in 


+  In  the  terminology  of  [4]  ,  S  is  the  unique  minifunction  for  H  . 


Each  such  F  can  be  viewed  as  an  extension  of  Hg  [4].  The 
parametrization  of  all  those  extensions  seems  to  be  strongly  related  to 
the  covariance  extension  problem. 


1-1  correspondence  with  a  factorization  of  the  spectral  density. 


In  our  view,  having  the  spectral  density  as  the  only  probabilistic 
information  available  to  us  implies  that  two  stationary  sequences  in  H 

i 

having  the  same  density  (a.e.)  are  indistinguishable  (equivalent).  Thus 
two  geometrically  (and  probabilistically)  different  Markov  processes  in 


inducing  identical  dynamical  representations  (see  Theorems  6.12, 


7.1) 


should  be  considered  equivalent.  This  equivalence  relation  between 


Markov  processes  is  available  to  us  via  the  characteristic  function  of 


Nagy-Foia§  associated  with  a  (completely  non-unitary)  contraction  or  its 
corresponding  (completely  non-unitary)  semigroup  of  contractions  [30]. 
Indeed,  we  demonstrate  that  for  a  subspace  XCH  the  (full  range, 
regular)  process  {UnX}J^is  (weakly)  Markov  iff  the  state  transition 
operator  A ,  which  is  the  compression  of  U  to  X ,  has  U  as  its  minimal 


unitary  power  dilation  in  (Proposition  6.2),  and,  moreover,  A  is  of 

class  C00  (Corollary  6.5).  Thus  {An)n>0  *s  a  cqo  semigroup  of 
contractions.  The  equivalence  relation  between  Markov  processes  will 


be  based  upon  this  Markov  semigroup.  In  view  of  the  dual  relationship 
between  the  Lax-Phillips  scattering  operator  model  and  dilation  theory 
[2]  ,  there  is  set  up  a  1-1  correspondence  between  a  Markov  process 
{L^X}  ^  and  a  Lax-Phillips  scattering  system  (Theorem  6.4).  According 
to  a  fundamental  result  of  Nagy-Foia$  [30,  Th.  VI.  2. 5]  the  characteristic 
function  of  a  Cqq  contraction  induces  a  model  space  (the  functional 
model),  and  under  the  so-called  (Nagy-Foiaf)  incoming  spectral 
representation,  the  action  of  this  contraction  on  the  model  space  is  the 


restricted  shift  (the  model  operator).  Thus,  two  contractions  having 


0  strongly. 
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+ 

coinciding  characteristic  functions  are  unitarily  equivalent ,  and  so 
are  their  corresponding  semigroups.  Noting  (as  was  established  in  [2]) 
that  the  characteristic  function  of  A  coincides  with  the  scattering  matrix 
0X  of  the  aforementioned  scattering  system  induced  by  {UnX}_°^,  we 
establish  in  this  way  an  equivalence  relation  between  Markov  processes 
(thus  abandoning  the  1-1  correspondence  with  their  scattering  systems) 
which  is  based  upon  the  coincidence  of  their  scattering  matrices 
(characteristic  functions),  that  is,  the  unitary  equivalence  between  their 
corresponding  Markov  semigroups.  This  enabled  us  to  demonstrate  that 
two  stationary  sequences  are  equivalent  iff  they  possess  identical 
dynamical  representations  induced  by  equivalent  Markov  processes  which 
represent  them  (Theorems  6.12,  7.1).  We  next  show  (Theorem  6.10) 
that  there  exists  a  1-1  correspondence  between  the  equivalent  classes  of 
(full  range,  regular)  Markov  processes  and  the  purely  contractive 
analytic  functions  ?eHoo(B(Cp))  which  are  inner.  Among  those  inner 
functions  we  wish  to  choose  those  and  only  those  which  are  (via  the 
above  correspondence)  a  representation  for  the  stationary  sequence 

OO 

{y(n)j-ooor  a  sequence  equivalent  to  it.  This  is  accomplished  in  Theorem 

7.2.  Thus,  the  desired  family  of  inner  functions  are  those  and  only 

those  £ ,  which  factor  the  scattering  matrix  S  :  S  =  ^  H^JBtC^)) . 

* 

The  left  coprime  factorization  S  =  QjQg  induces  the  minimal  equivalent 
+++ 

class  (Theorem  7.3).  This  is  in  direct  agreement  with  deterministic 
systems  realization. 


+  Definition  6. 8. 

This  function  coincides  with  the  structural  function  of  Lindquist- 
Picci  [29]. 

A  corresponding  result  holds  for  the  right  factorizations. 
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Having  associated  an  inner  function  0  to  each  Markov  process 
{UnX}_°^,  there  is  a  natural  degree  associated  with  it,  namely,  d(0)  = 
det  0.  +  Accordingly  we  obtain  d(Q^)  <  d^),  that  is,  the  minimal 
class  is  of  the  weakest  degree.  Our  approach  enables  us  (by  inspection) 
to  derive  the  fact  that  a  Markovian  representation  is  minimal  iff  it  is 
observable  and  constructible ,  and  exact  observability  and  constructibility 
hold  iff  range  is  closed. 

The  scattering  approach  was  found  to  be  useful  in  various  prediction 
and  interpolation  problems  of  stationary  processes.  This  was  demonstrated 


by  Adamjan-Arov  in  [3].  They  consider  a  general  situation  in  which 

++ 

for  two  jointly  stationary  1-dimensional  processes  £,n  we  are  given 

the  past  H  (0)  of  n,  and  the  future  Ht(m)  (m  >  0)  of  £,  and  we  wish  to 

*1  ^ 

linearly  predict  n(k) .  k  >  0  based  on  H  (O)VHt(m).  This  general  set-up 

ni 

is  applicable  to  prediction,  interpolation,  and  filtering.  By  posing  the 
problem  in  terms  of  incoming  and  outgoing  data,  the  corresponding 
scattering  matrix  can  be  computed  and  the  predictor  expressed  in  terms 
of  this  scattering  matrix.  Thus,  generally  speaking,  this  approach 
transforms  the  projections  in  the  space  of  values  of  the  processes  into 
the  corresponding  ones  in  L ^  in  which  a  solution  can  be  obtained. 

Another  area  in  which  the  scattering  matrix  appears  to  have 
considerable  appeal  is  in  model  reduction.  As  the  only  accessible  data 


is  the  spectral  density  f 


we  would  ultimately  be  concerned  with 


obtaining  rational  approximations  to  this  density.  We  formulate  (Chapter 


8)  two  problems  in  model  reduction  both  of  which  involve  the  Hankel 


operator  H<..  In  Problem  A  we  wish  to  approximate  the  p-dimensional 


+  Its  scalar  multiple  in  the  terminology  of  [30]. 

As  they  point  out,  their  approach  generalizes  to  the  vector  case. 


process  y  with  a  p-dimensional  process  y  of  reduced  order  m ,  in  such 
a  way  that  the  'distance'  between  them 
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e  =  sup{  ||y.(n)  -  y.(n)j|„  :  j  =  1 . p,  n tZ) 

y 

is  minimized.  The  motivation  for  Problem  A  stems  from  the  fact  that  for 
the  covariances  {Cn)_“  ,  {Cn}“ro  of  y,  y  we  then  obtain 


C  -  C  I  <  2 

n  n  1 


^ ^  P  [  X)  a 


i  £ 


i-  “vj 


We  give  a  solution  (generally  non- unique)  to  the  above  problem  (Theorem 
8.2)  which,  however,  can  not  be  obtained  in  a  constructive  fashion. 

This  problem  leads  naturally  to  a  weaker  version  of  finding  an  inner 
function  Q^H^BtC^))  of  degree  d(Q)  <  m  such  that  jjH^s|j  =  min. 
(Problem  B).  This  is  a  Hankel  approximation  problem  in  disguise.  For 
the  p  =  1  dimensional  case  we  obtain  a  constructive  solution  in  the 


following  fashion:  We  consider  the  rank  m  Hankel  appro ximant  H 
to  H  .  The  desired  function  q  is  obtained  by 


m 


H2  ©  qH2  =  range  Hc 


rm 


and  is  constructed  from  the  Schmidt  pairs  corresponding  to  the  m-th 
singular  value  s  (H  ).  On  the  Markov  subspace  corresponding  to  q 

m  o 

we  project  y(0)  and  thereby  obtain  a  process  y  whose  density  f~~  is 

y  y 

of  degree  at  most  m ,  and  for  its  moments  we  obtain  the  bound 
(Proposition  8.5) 


The  vector  case  can  not  be  treated  in  generality  since  is 


in  general  not  necessarily  strictly  non-cyclic. 


m 
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<2sm(Hg)  ,  n  =  0,  ±1 . 

In  closing  we  mention  that  various  properties  of  stationary  processes 
were  found  to  be  reflected  in  (or  S),  such  as  the  strong  mixing 
property,  complete  non-determinism,  and  strict  non- cyclicity. 
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CHAPTER  TWO 
NOTATION 


Z  stands  for  the  set  of  integers ,  6(n)  for  the  indicator  function  of 
{0 }CZ,  C  the  complex  numbers,  and  for  acC  a  denotes  the  complex  conjugate 
of  a.  For  a  matrix  A  =  (a-)^  we  denote  by  A*  the  Hermitian  conjugate 


i  0=1 


A  Q 

of  A  :  A  =  (b..)F  b..=  a..,  and  by  A1  its  transposition.  For  a  family 

3  i,j=l  13  31 

of  subsets  {M.}  of  a  Hilbert  space  H,  we  denote  by  VM.  the  smallest  closed 

’  i  1 

linear  manifold  (subspace)  that  includes  each  M. ,  and  by  AM.  the  greatest 

L3 

subspace  contained  in  each  of  them  (their  intersection).  M.  denotes  the 
closure  of  M.  in  H.  For  subspaces  M  ,N  of  H,  M0N  denotes  the 
orthogonal  complement  of  N  in  M.  For  a  countable  family  {M.}  of  mutually 

_  3  j 

orthogonal  subspaces  :  M.  ■*•  M.  i  #  j  ,  we  let  ^  ©M.  be  their  orthogonal 

1  ]  3 

sum.  P^j  stands  for  the  orthogonal  projection  of  H  onto  the  subspace  M. 
For  a  bounded  linear  operator  A  :  H^  -*■  of  Hilbert  space  Hj  into  H2> 
we  denote  by  [A]  the  matrix  of  A  with  respect  to  specified  orthonormal 
bases  in  Hj,H^.  Aj  M  stands  for  the  restriction  of  A  to  the  subspace 
McHj.  B(H1,H2)  denotes  the  Banach  space  of  all  bounded  linear  operator 
from  Hj  into  H2  with  B(H)  =  B(H,H). 


By  l2(-“>,  N)  we  denote  the  usual  Hilbert  space  of  sequences  {h.}°° 

'  «  2  3  3="°° 
with  values  in  (the  Hilbert  space)  N  for  which  £j|h.  ||  <  00 . 

j  3  N 

12(0,«°;  N),  l2(-°°,-l;  N)  are  seen  naturally  as  subspaces  of  12(-00  *  00 ; 
L2,Loowill  denote  respectively  the  Lebesgue  spaces  on  the  circle 
T  =  (e1/1v  :  X«l— tt  ,tt] }  (with  respect  to  the  normalized  Legesque  measure 


)  of  square  integrable,  essentially  bounded  complex  valued  functions. 

Each  function  can  be  viewed  as  defined  as  [- tt ,  n  ] .  Similarly  for  the 

spaces  L2(CP),  Lm(CP)  of  functions  f  taking  values  in  Cp  for  which 

||f(*)(|  eL  ,  ||f(»)||  g  respectively.  LJ.B(CP))  is  defined 
Cy  F 

analogously  for  weakly  measurable,  B(CP)  valued  functions  f  for  which 

ess.  sup{  ||f(eiA)  ||  :  A«[-tt,  tt]}  <  =°. 

J5(CP) 

H2  are  the  subspaces  of  L2  defined  by 


+ 

2  =  ' 

}f'L2 

1 

‘  2  TT 

/ f(elA)e  lnA  dA 

-tt 

=  o, 

n=  -1 ,  -2 ,  . 

2  = 

{f<L2 

1 

:  2  TT 

TT 

/ f(elA)e“lnA  dA 

-  TT 

=  0, 

n=  0,  1,  2, 

and  we  have  the  orthogonal  decomposition  L2  =  H+  ©  H2  .  Each  f«H2 
having  a  Fourier  series 


«.,A>  -  £ Vto* 
0 


generates  the  function 


g(z)  =  £vn 
0 

belonging  to  the  Hardy  class  H2  of  functions  g(z)  holomorphic  in  |  z  |  <  1 
and  such  that 


sup 
0<r<  1 


2tt 


i 


J 


<  oo 
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Moreover,  the  (a.e.  existing)  radial  limit  g(elA)  of  g(z)  equals  f(e^) 

a.e.  and  ||f|lL  =  ||g||H  •  The  function  g(z)  is  seen  as  the  analytic 
2  +  2 

extension  of  f‘H2  to  the  unit  disc  |zj  <  1  and  is  denoted  by  f(z).  We 

identify  H*  with  H2  and  denote  them  commonly  by  Using  the 

conjugation  with  respect  to  the  unit  circle  (z  -»■  i),  by  the  reflection 

z 

principle,  for  feH2CL2  the  function  f  defined  by  f(e^)  =  f(e^)  has  an 
analytic  extension  to  |z|  >1  :  f(^) ,  which  we  again  denote  by  f.  The 
space  H2  =  {f«L2  :  f«H2}  is  the  space  of  functions  feL2  having  an  analytic 
extension  to  the  exterior  of  the  disc  f(z)  and  we  have 

*  $ 

llfliL  =  SUP  -h  /lf(pelX>l 2  dX 
2  P>1  L  -  7T  J 

feH2  are  called  conjugate  analytic. 

Analogously  for  the  Banach  space  we  have  the  subspaces 
of  functions  feL^  having  an  analytic  extension  f(z)  to  |z|  <  1  with 

||f||  =  sup  |f(z)|  =  ||f  ||H  . 

Loo  |2|<1  Hoo 

Similarly,  for  the  Hilbert  space  L2(cP)  we  have  the  subspaces  H2(cP)  = 
H2(cP),  H~(cP)  with  the  orthogonal  decomposition  L2(cP)  =  H2(cP)  © 
Hl(cP).  In  L  (B(CP) ) ,  again  H  (B(CP))  is  defined  as  the  subspace  of 

functions  in  L  whose  negatively  indexed  (matrix  valued)  Fourier 

00 

coefficients  vanish.  For  QeH  (B(CP))  the  function  0*  defined  by 

00 

9*(ei^)  =  [0(e^)]*  is  identified  with  its  analytic  extension  0*(| )  = 

10(4)1*  to  j  z  |  >  1. 

z 


A  function  feH^  is  called  inner  if  |f(e1A)|  =  1  a.e.  .  Similarly  for 

OeHJB(CP))  if  0(e^)  is  unitary  a.e.  .  feH,  is  called  outer  if  V{xnf)  = 

Hg  where  x  denotes  the  function  on  T  defined  by  x(e*  )  =  e  .  For 

the  Toeplitz  operator  :  H2(CP)  ^(C1*)  whose  matrix  is 

block  Toeplitz  with  respect  to  the  standard  basis  {e^e.  ,e*^e  . . .  ,e^e  } 

1  p  k 

{ej,e2,*..  .ep}  being  the  standard  basis  in  Cp ,  is  defined  by  T^f  = 
ff+(4>f)  where  tt+  is  the  Riesz  projection  of  L,2(CP)  onto  HgtC^).  will 

denote  the  Hankel  operator  [with  block  Hankel  matrix  with  respect  to  the 
standard  bases  in  H2(CP),  H“(CP)]  H^:  H2(Cp)  H“(CP)  defined  by 
H^f  =  n_(<Pf),  tt_  being  the  Riesz  projection  of  L2(CP)  onto  H2(CP).  The 
convention  we  employ  regarding  a  Hankel  operator  as  acting  from  H2(CP) 
into  H2(Cp)  is  not  in  accordance  with  the  one  employed  in  systems  theory, 
in  which  we  act  on  H2(CP)  into  HgCCP)  :  H^f  =  TT+(<j>f).  It,  however, 
conforms  with  the  one  employed  by  Adamjan-Arov-Krein  and  enables  us 
to  use  their  results  without  modifications,  as  well  as  to  refer  to  them. 


CHAPTER  THREE 


THE  SCATTERING  OPERATOR  MODEL  AND  THE  SCATTERING 
MATRIX  ASSOCIATED  WITH  A  STATIONARY  STOCHASTIC  PROCESS 

Let  H  be  a  complex  separable  Hilbert  space  and  let  U  be  a  unitary 
operator  on  H.  A  subspace  D+  is  said  to  be  outgoing  for  (U,H)  if  it 
satisfies 

(i)  UD  +  c  D  + 

(3.1) .  (ii)  AUnD  =  (0} 

— oo  T 

(iii)  VUnD  ,  =  H  . 

-OO  ' 

A  subspace  D_  for  which 

(i)  U*D_  C  D_ 

(3. 1) _  (ii)  _JunD_  =  ^ 

(iii)  VUnD_  =  H 

-OC 

is  said  to  be  incoming  for  (U,H). 

3.1  Definition.  A  quadruple  (U,H,D  +  ,D_)  satisfying  (3.1)  is  said  to 
be  a  scattering  system. 

We  shall  be  interested  in  a  scattering  system  arising  in  the  following  way. 
Let  (Q„V,P)  be  a  fixed  probability  space  and  let 

/yl(n)\ 

(y(n)  :  neZ }  ,  y(n)  =  |  y2(n)  | 

\vn)/ 
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be  a  centered  stationary  process  with  y.  (n)^  P)  j  =  l,  . ..,  p. 
Let  f  (X)  =  (f..(X))p  ,  Xe[-iT,TT]  be  its  spectral  density  satisfying 

yy  kJ  k)j=l 

1  f* 

(3.2)  J  log  det  f  (X)  dX  >  , 

-TT  -  - 

i.e.,  the  process  is  regular  and  of  maximal  rank. 

Let 


H  =  H  =  V  (y.(n) ,  y„(n)  , 
y  neZ  1  1 


yp(n)  }c  ^  2(a,.  r/,P) , 


be  the  space  spanned  by  the  process  and  let  U  be  the  unitary  shift 
operator  on  H  associated  with  the  y  process  [36,  p.  14]: 


Uy.(n)  =  y.(n+l)  j  =  l,...,p,  n  eZ  . 

We  consider  the  past  and  future  of  {yCn)}”^  defined  by 


D_  =  H'(0)  =  V{y  (k),...,  y  (k) }  ,  D  i  H+(0)  =  V{y.(k),...,  yn(k)}. 

y  k<0  1  p  y  k>0  1  p 

By  (3.1)  it  follows  [36,  Th.  II. 6.1] 

AUnD  =  {0}  =  AUnD  . 

-OO  _  00  + 

We  readily  obtain  that  (U,H,D  +  ,D_)  is  a  scattering  system. 

3.2  Theorem  (Translation  Representation  Theorem  [27,  Th.  II.l.l]). 

Let  (U,H,D+)  be  outgoing.  Then  there  exists  a  Hilbert  space  N+  and  a 
unitary  map  r+  of  H  onto  N+)  such  that 


l  *L  »  1  « 


(i)  r+[D  +]  =  12(0,oo;N+)  , 

(3.3) 

(ii)  U+  =  r+Ur~1 

is  the  right  shift  operator  on  12 (—  “»°°;N+) .  This  representation  is  unique 
up  to  automorphisms  of  N  . 

Proof  (cf.  [2,  p.  77]).  By  (3.1)  +  -ii  the  operator  U{D  +  is  an  isometry 
having  no  unitary  part.  By  Wold's  decomposition  theorem  [30,  Th.  1.1.1] 
we  may  write  uniquely 

CO 

(3.4)  D+  =  £©UnN+  N+  =  D+©  UD+  . 

n=0 

Since  for  any  m  >  0 

IT)— 1  . 

U"mD,  =  U~m[(D,©  L^DJ®  [/"DJ  =  U  mf(  £  ©  UKN  )  ©  UmD  ]  = 

k=0 

-m  . 

=  (  E  ©  UKN  )  ©  D.  , 

k=-l 

we  obtain  by  (3.1  -iii)+  that 

PC 

(3.5)  H  =  £©  UnN+  . 

—  00 

It  follows  that  for  arbitrary  heH 

h=f>UnPNU‘nh  ,  ||h||2=  E  llpN  u_nh II h  • 

—  OO  +  “00  + 


Hence  the  map 


r+  :  H  -  l2(-«,  »;  N+) 


defined  by 


(3.6) 


r  h  =  {PM  U"nh}“ 

+  N .  Jn=-C 


is  isometric.  Since  for  (li  }°°e  10(- «,  <*>;  N  ) ,  h  =  T]unh  eH,  the  map  r 

n  -oo  ^  +  rs,  n  c  + 

is  onto  and  thus  unitary.  By  (3.4)  we  obtain  (i).  From  (3.6) 


r  Uh  =  PKT(U  (n_1)h)  =  U  (r  h)  . 

+  N+  n=  +  + 


and  (ii)  follows.  By  (3.5)  U  is  a  bilateral  shift  of  multiplicity  equal  to 
dim  N+  and  the  uniqueness  follows.  -4 

3.3  Definition.  The  representation  (U+,  12 ( 0 ,  »;  N+) ,  l2(-oo,  co;  N+))  is 
called  an  outgoing  translation  representation. 

For  (U,H,D_)  incoming  we  similarly  obtain 


(3.7) 


d_  =  ^2  ©  unN_  N-  =  D_  O  U*D_  , 


(3.8)  H  =  ^  ©  uRn_  • 


For  the  corresponding  map  r_  of  H  onto  l2(-o>,  N_)  we  define 


is  the  right  shift  on  lg (_  00 ,  °°;N_).  The  representation  (U_,  l2(-°o,  ~1> 

l2(-°°,  00 ;  N_))  is  called  an  incoming  translation  representation. 

Now  let  (U,H,D  +  ,D_)  be  the  scattering  system  associated  with  the 

* 

regular  maximal  rank  y  process.  The  subspace  N_  =  D_  0  U  D 
(N+  =  D+G  UD+)  is  the  forward,  (backward)  innovation  subspace  at  n 
Since  for  a  scattering  system  (U,H,D  +  ,D_)  we  have 

dim  N_  =  multiplicity  U  =  dim  N+  , 

we  can  arrange  the  maps  r+  to  be  onto  l2(-“,  °°;  Cp). 

3.4  Definition  ([2], [27]).  The  operator 

S  =  r_r+  1  :  l2(-°°  >  06 ;  CP)  •>  l2(-°°.  Cp) 

is  called  the  abstract  scattering  operator. 

Clearly  S  is  unitary.  Denoting  by  V  the  right  shift  on  l^-00  ,  00;  Cp), 
we  readily  obtain  by  the  translation  representation  theorem. 

(3.10)  SV  =  rr^V  =  rUr’1  =  VS  . 

Let  F  :  l2(_co.  CP)  -*■  L2(CP)  be  the  Fourier  transform  operator.  Th 
unitary  operator 
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thus  commutes  by  (3.10)  with  L  ,  the  operator  of  multiplication  by  X. 

X 

It  follows  [11]  that  FSF-1  is  a  Laurent  operator  L^,  SeLJlB(CP))  such 
that 


S(elX) 


a.e. 


is  a  unitary  map  on 


CP. 


3.5  Definition  ([2]  ,[27]).  S  is  called  the  scattering  matrix. 

It  is  clear  from  the  translation  representation  theorem  that  S  is  determined 
to  within  right  and  left  multiplication  by  unitary  transformations  on  Cp 
(i.e.  ,  to  within  coincidence,  see  Definition  6.8). 

We  next  compute  the  scattering  matrix  S  for  the  y  process.  Let 

{  v.  (0) ,  ...,  v  (0) }  be  an  orthonormal  basis  for  N_.  Let  (n)  = 

^  P  3 

UnVj(0)  and  define 

/  vi‘n)  \ 

v(n)  =  (  ;  )  neZ  . 

U(n)/ 

By  (3.8),  the  process  { v(n) } ”„  is  a  (centered)  white  noise  process 
with  covariance  R.  (n)  =  <5(n)I  _  ,  constituting  the  forward  innovation 
process  for  the  jr  process.  It  is  determined  up  to  a  choice  of  basis  in 
N_.  By  (3.7),  we  may  write 


00 

(3.11)  y  (0)  =T^A(k)v(k)  A(k)  =  (a..(k))p  A(k)  =[0],k>0. 
^  1]  i,j=l 


(Wold's  representation  cf.  [36,  p.  56]).  It  follows  from  (3.9) 

P 


r_y.(0) 


=  lAaj 

m=l  1 


1m(k+l)  v  (0)1 
)m  m  /  k=  -» 
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Identifying  N_  with  cP  we  readily  obtain  the  representation 


(3.12) 


^ajjCk+1)  x 


r-yi(0)  -i\.  •»«> 


IP 


)! 

k=- 


Consider  the  function 


uu 

A(z)  =  ^  A'(k)zk 


Since 


2  1  laij<k>|2  *  2  i!yj(0)  Hh  <  00 , 


k=-°o  i,j=i 


i=i 


A(z)  is  analytic  in  I  z  I  >1.  For  A(z)  we  have  [36,  p.  57] 


(3.13)  A*  (z)A(z)  =  fyy(A) 


By  the  incoming  properties 


H9(CP)  =  V  {eln>  A(elA)a  :  aeCP }  , 
n<0 


i.e.  ,  A  is  conjugate  outer  [21,  p.  121].  From  (3.12) 


(Fr_y^(0) ,  ...  Fr_y  (0))  =  yj 


Since  the  translates  (in  H^)  of  yj(0),  ....  y^tO) 
combinations  are  dense  in  H  ,  Fr_  is  determined  by 


and  their  linear 
the  above  expression. 


We  now  consider  the  outgoing  representation.  Let  (Ej(0) . 

be  an  orthonormal  basis  in  N  .  We  similarly  obtain 


y°» 


00 

y(0)  =  ^  B(k)e(k) 

—  00 


B(k)  =  (0..(k))P  ,  B(k)  =  tO]  ,  k<( 

i.)=l 


This  representation  constitutes  the  representation 
the  backward  innovation  process  {^(n)}^  ,  £(n) 

We  define 


of  y(0)  in  terms  of 
\cp<n)/ 


r(z> 


oo 


2B'(k)zk 

o 


which  is  analytic  in  jz  |  <  1.  In  a  similar  fashion  to  (3.13),  we  obtain  by 
direct  computation 

(3.15)  ~^r*(z)r(z)  =fyy(A)  z  =  e1A  , 

with  T  being  outer.  Also 

(3.16)  (Fr+yx (0) ,  .  .  .  ,  Fr+yp(0))  =  r  . 

Combining  (3.14)  with  (3.16),  we  obtain 

sr  =  xa 


and  thus 


S  =  XAT 


-1 


i  > 

Using  (3.13)  and  (3.15),  one  easily  verifies  that  S(e  )  is  unitary 
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a.e.  Xe[-Tt,  tt]  .  We  thus  obtained 

3.6  Theorem.  For  a  regular  maximal  rank  process  {y(n)}  we  have 

(3.17)  S  =  XAT-1  , 

where  S  is  determined  up  to  left  and  right  multiplication  by  constant 
unitary  matrices. 

For  the  case  p  =  1  we  have 

3.7  Corollary.  For  a  regular  process  {yfn)}00^ 


(3.18)  s  =  "x-jr 


and  s  is  determined  up  to  multiplication  by  a  constant  of  unit  modulus. 


Proof.  The  outer  function  A  satisfies  |A  |  =  |r  |  on  T  and  thus  A  =  y  r 
a.e.  where  Y  is  a  constant  of  unit  modulus. 


3.8  Remark.  The  scattering  matrix  S  was  defined  by  an  outer  and 
conjugate  outer  factors  of  the  density  f  .  Since  those  are  determined 
up  to  left  multiplication  by  a  constant  unitary  matrix,  we  may  wish  to  make 
a  canonical  choice  (which  amounts  to  choosing  specific  orthonormal  bases 
in  N+,N_)  in  the  following  fashion:  For  T(0)  we  consider  its  polar 
decomposition  T(0)  =  KP  (K  unitary,  P  >  0)  and  define  ^(z)  =  K_1T(z). 

For  ^  we  have  r^(0)  >  0  and  (3.15)  holds.  This  r  is  unique.  Similarly 
for  A.  In  this  way,  the  density  f  will  have  a  unique  S  associated  with 
it.  From  the  viewpoint  of  seeing  S  as  the  phase  function  associated  with 
f  ,  this  may  be  appealing.  Note,  however,  that  S  measures  the 


1 


interaction  between  the  past  and  future  of  the  process  £  (see  discussion 
following  theorem  4.5)  and  uniqueness  in  the  reverse  direction  (from  S 
to  f  )  is  not  possible. 

yy 


CHAPTER  FOUR 


THE  INDUCED  HANKEL  AND  TOEPLITZ  OPERATORS 

We  call  the  unitary  maps  F_  =  Fr_,  F+=  F r+  -  the  incoming  and 
outgoing  spectral  representations ,  respectively. 

4.1  Proposition .  We  have 

(i)  F_[D_]  =  H‘(Cp)  . 

(ii)  F+[D+J  =  SH2(CP)  , 

(iii)  F_(Uh)  =  ■  F_h  heHy  . 

Proof,  (i)  and  (iii)  follow  from  the  properties  of  the  incoming  translation 
representation.  By  the  definition  of  S  we  obtain 

F_[D+]  =  (Fr.r^F*)  Fr+[D+]  =  SH2(CP)  . 


4.2  Lemma.  The  operator 


P_P+  :  D+  -*•  D 


P  =  P 


D 


is  unitarily  equivalent  to  the  Hankel  operator  H  ,  and  the  operator 


P  .  p+  =  D  -  D. 
D 


is  unitarily  equivalent  to  the  Toeplitz  operator  TQ. 


Proof.  For  heD+  and  f  =  F+h  we  have 

F(P_P+)F*  f  =  F_P_h  =  F  P_ F*F_F*f  =  (F_P_F*)  Sf  . 

Since 

F_P  F*  =  TT_ 

we  obtain 

F_(P_P+)  F+*f  =  figf  fcH2  , 

and  the  following  diagram  commutes 


The  unitarity  of  F_,F+  implies  that  P_P+  is  unitary  equivalent  to  H 
This  proves  the  first  part.  Now  since 

FdH-P-)F+f  =  F_F*f  "  F-P-F+f  =  S{~  *-s{  =  \Sf  =  Tsf 


the  following  diagram  commutes 


T 


which  proves  the  second  part. 


4.3  Lemma.  For  Hg,  Tg  we  have  the  identity 


HS  HS  +  TS  TS  IH2(d?)  * 


Proof.  Clearly , 


Hs  g  =  1t+(S  g) 


geH'(CP) 


It  follows  that  for  feH2(CP): 


HSHSf  =  =  V*t(IL  (CP)_7T+)Sf]  = 

2 


=  7T+S*Sf  -  v+S  r+Sf  =  f  -  T*  Tsf  . 


By  a  theorem  of  Nehari  [31]  (a  vector  generalization  of  which  was  obtained 

by  Sarason  [41]),  for  a  bounded  Hankel  operator  H  ,  there  exists 

9 

a  function  <b  eL  such  that  H  =  H  and 

Yp  00  <ptp 


h  II  =  |U„ 


(4.1) 


9 


OO 
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Since  always  ||H^[|  <  ||<J)^||00  because  of  (4.1),  <f>  is  called  a  minifunction 
for  H  [5,  p.  6].  The  question  regarding  the  uniqueness  of  the 

V 

minifunction  is  of  particular  interest  to  us.  In  system  realization  theory 
for  a  frequency  response  function  ^eH^,  (same  reasoning  holds  for  the 
vector  case),  a  central  role  is  played  by  the  Hankel  operator  : 

H2  -*■  H2,  H^f  =  TT+(0f).  Now  observe  that  if  (^,4>2eH00  are  such  that 

then  =  0,  and  the  positive  Fourier  coefficients  of  ^ 

vanish.  Thus  <J>j- $2  =  const,  and  determines  (the  analytic)  <f>  up  to 
an  additive  constant  (recall  in  this  regard  that  the  composition  of  the 
reachability  and  observability  maps  determine  the  frequency  response 
function  up  to  an  additive  constant).  However,  in  general,  (4.1)  does 
not  hold  and  the  uniqueness  of  <t>  inducing  is  guaranteed  by 
analyticity  (causality) .  In  view  of  the  central  role  played  by  S  in 
realization  of  stationary  sequences  (Chapter  7) ,  the  following  theorem  is 
of  significance: 

4.4  Theorem.  The  Hankel  operator  Hg  determines  S  uniquely.  Indeed, 

S  is  its  unique  minifunction. 

Proof.  From  Lemma  4.3  we  note  that  ffKerTg  iff  f  is  an  eigenvector  of 
H^HS  corresponding  to  the  eigenvalue  [|Hg||  =  1.  Since  S  =  xAT 
every  column  of  F  belongs  to  this  kernel.  Thus,  the  projection  of  the 
above  eigenspace  on  the  first  coordinate  in  ^(O,00;^)  spans  T(0).  Now 
observe  that  for  F(0)  we  have,  because  of  its  outer  property  in  H2(£(CP)), 
(see  e.g.  (36,  p.  76] ) 


so  that  F  (0)  is  of  full  rank.  We  conclude  that  the  aforementioned 
projection  is  onto  the  first  coordinate  space.  According  to  a  result  of 


Adamjan-Arov-Krein  [4,  Corollary  3.1j  for  a  Hankel  operator  to 
have  a  unique  minifunction,  it  is  sufficient  that  the  projection  of  the 
eigenspace  of  corresponding  to  ||HjJ|  on  the  first  coordinate 

space  be  onto.  The  result  follows. 

There  is  an  alternative  way  to  rephrase  Theorem  4.4.  If  we 
consider  the  Laurent  operator  L  of  multiplication  by  <j>  on  Lg,  then 

9  Vi  u 

(4.1)  becomes 

IK  ii  ■  IK  11 

u 

Since 


H 


4> 


P 


H 


-L 

2 


$ 


one  considers  L  A  as  norm  preserving  lifting  of  H,.  Thus,  in  system 

yn 

theory,  the  uniqueness  of  the  lifting  is  guaranteed  by  causality,  the 
lifting  being  in  general  not  norm  preserving,  while  for  stationary 
processes,  the  uniqueness  is  guaranteed  by  the  lifting  being  norm 
preserving. 

Viewing  a  linear  time  invariant  system  from  the  input-output  point  of 
view  makes  the  frequency  response  function  the  sole  accessible  object 
containing  all  pertinent  information  about  the  system.  As  to  the 
information  contained  in  the  scattering  matrix,  we  have  the  following: 

4.5  Theorem.  The  scattering  matrix  S  determines  the  density  f  (A) 

-  jy 

up  to  the  form 


33 


(4.2)  K*f  (  X)K 

yy 

where  K  is  a  constant  pxp  non-singular  matrix,  iff 

(4. 3)  dim  Ker  T  =  p 

Proof.  First  note  that  for  any  representation  of  S 
S  =  XYX-1 

with  the  columns  of  X  in  H2(C^)  and  those  of  \Y  in  H2 (CP) ,  the 
columns  of  X  belong  to  Ker  T<..  Moreover  (on  T) 

Y*Y  =  (SX)*SX  =  X*X  . 

Assume  (4.3)  holds.  It  thus  follows  that 

(4.4)  X(elX)  =  T(elX)K 

where  K  is  a  pxp  full  rank  constant  matrix.  Thus, 

~X*(z)X(z)  =  ^KV(z)F(z)K  =  K*fyy()K  z  =  e1 '  , 

proving  the  'if  part. 

Now  assume  (4.3)  not  to  hold,  i.e. ,  dim  Ker  >  p  .  We  can  thus 
find  a  pxp  matrix  X(elX)  of  full  rank  a.e.  X  such  that  the  columns  of 
X  belong  to  Ker  T<,  and  (4.4)  does  not  hold.  If  we  define 


Y  =  XSX 


then  the  columns  of  ;<Y  are  in  H2  (C^)  and  S  =  XYX  with  Y  Y  =  X  X. 


The  result  follows. 


That  the  scattering  matrix  determines  f  up  to  the  form  (4.2)  is 

a  natural  consequence  of  the  scattering  framework.  Indeed,  for  an 

* 

arbitrary  non-singular  K,  the  process  £(n)  =  K  y(n)  whose  density  equals 


f  =  K  f  K 

ss  yy 


induces  the  same  scattering  system  (U,H,D  ,D_)  as  the  y  process. 
We  next  characterize  condition  (4.3)  on  a  process  level. 


4.6  Proposition.  We  have 


F*[KerTs]  =  H~  (O)AHy  (0) 


Proof.  Let  0  ?  feKer  T^.  From  Lemma  4.3  it  follows 


Hs  HSf  =  f 


i.e.  , 


IV”  = 


By  Lemma  4.2,  we  obtain  for  £  =  F*feH*(0) 


llP.Cll  =  lie 


and  ^eHy(O)  .  Thus 


F+[Ker  Ts]  c  H~(0)AH  +  (0) 


Now  let  r«H  (0)AH  (0).  It  follows  from  Lemma  4.2 

-  y  y 


Hs(F+0  . 

Let  f  =  F+^eH2(C^).  We  obtain 


l|Hsf  || 


FJ  II  =  II  C  ||  =  ||F+C 


HsV 


Thus  feKer  which  implies 


F+[H  (0)  A  H  ( 0 ) ]  C  Ker  Ts 


The  result  follows. 


By  the  unitarity  of  F^ 


dim  Ker  T^  =  dim  Hy(0)AHy(0) 


and  since  y  is  regular  and  of  full  rank  we  readily  conclude 


dim  Hy(0)AHy(0)  =  p  iff  dim  Hy  (0)AHy(l)  =  0 


4.7  Definition  [9].  The  process  y  is  said  to  be  completely  non- 
deterministic  if 


«y(0)AH  (1)  =  {0} 


36 


As  is  well-known  (see  e.g.  [36,  p.  73])  for  a  regular  maximal  rank 

process  {y(n)}°°  no  y.(k)  k^l,  j  =  1 . p  can  be  predicted  without 

error  based  on  the  past  H^(0).  Being  completely  non-deterministic  is 
more  restrictive;  indeed,  no  value  in  H*(l)  can  be  predicted  without  error 

j L 

based  on  H  (0). 

2 

Summarizing,  we  can  restate  theorem  4.5  in  the  following  way. 


4.8  Theorem .  The  scattering  matrix  S  determines  f  up 

j z  *L 

(4.2)  iff  y  is  completely  non-deterministic. 


to  the  form 


4.9  Remark.  It  is  of  interest  to  observe  that  since  for  a  completely 
non-deterministic  process ,  the  eigenvectors  of  H  corresponding  to 
||H„||  are  only  the  columns  of  F,  the  projection  of  this  eigenspace  on  the 
first  coordinate  is  not  only  onto,  but  also  1-1.  In  [4,  Sec.  2]  it  is 
shown  that  for  any  Hankel  operator  H  :  H2(CP)  H2  (CP)  satisfying  this 
condition,  its  unique  minifunction  is  of  the  form 


pS 


!|H|| 


Thus  up  to  a  constant  multiple  p  >  0  all  minifunctions  of  such  Hankel 
operators  are  in  1-1  correspondence  with  regular,  full  rank,  completely 
non-determinstic  processes. 


4.10  Example.  Let  {y(n)}^.have  rational  density 

,2 


fyy(x)  = 


I  P(Z) 
|Q(z)| 


z  =  e 


iX 


where  the  polynomials  P  ,Q  have  no  zeros  in  |z|  <  1  and  are  relatively 
prime.  Since  f^eL^,  the  polynomial  Q  has  its  zeros  in  |z|  >  1.  Write 


where  Pj^  of  degree  k  has  its  zeros  on  T  and  P2  in  |z  |  >1.  For 
k  k 

P-(z)  =  n  (z-a.)  ,  {a,}  C  T  we  have 

j=l  J  J  3=1 


P^e1*) 

P^e^) 


-ik\  ^_^k 


k 

n 

3=1 


Thus 

k+1  ^2  k  ^  - 

(4.5)  s  =Yx  1  -r  Ve  =  -£■ ,  Y=  (-1)K  n  , 

\  Q  j=l  ] 

where  J  Y  |  =  1  and  is  outer.  In  [5]  Adamjan-Arov-Krein  show  that 
(4.5)  is  the  general  form  of  unimodular  minifunctions  and  that  in  this 
case  k+1  is  the  dimension  of  the  eigenspace  corresponding  to  the 
singular  value  1  =  ( [ Hs  ( {  [5,  Th.  2.2].  From  Lemma  4.3  it  readily 
follows  that  this  dimension  equals  dim  Ker  Tg  .  Thus  + 

dim  H~  (O)AHy  (0)  =  k+1  . 

We  conclude  that  a  regular  process  with  rational  density  is  completely 
non-deterministic  iff  it  has  no  zeros  on  T. 

00 

4.11  Example.  Let  (yln)}^  have  density 

0  <  m  £  f  (A)  ^  M  <  00  a.e. 

yy 

It  readily  follows  that  the  outer  factor  reHro  and,  moreover,  1/TcH^  . 


+ 


This  reproduces  a  result  of  Bloom  field- Jewel- Hayashi  [2,  Th.ll] 


Now  let  geKer  T  ,  for  arbitrary  feH2 

0  =  (Ts  g,f)  =  (xy  g,f)  =  ,  xrf)  . 

Since  r  is  outer  and  g/r«H2>  we  conclude 
g/„  h2  0  xh2  =  C  . 

Thus , 

Ker  Ts  =  {XT  :  XeC} 

and  dim  Ker  Ts  =  1.  The  process  y  is,  therefore,  completely  non- 
deterrninistic. 

We  now  formulate  a  converse  to  Theorem  4.5.  The  question  can  be 
posed  as  follows:  Under  what  conditions  is  a  function  )  a 

scattering  matrix  of  some  full  rank,  p  dimensional,  completely  non- 
deterministic  process.  On  an  abstract  level  we  first  observe  that  any 
SeL^fBCcP))  which  is  unitary  valued  a.e.  on  T  is  the  scattering  matrix 
of  the  canonical  scattering  system  [2] 

U  =  ,  H  =  L2(CP)  ,  D+  =  SH2(CP)  ,  D_  =  H2(CP). 

The  above  questions  amounts  to  characterizing  all  scattering  systems 

(U,H,D  +  ,D_)  for  which  there  exists  a  set  (Cj . £p}  of  linearly 

independent  vectors  such  that 

H  =  span  {Un  :  j=l,  ...,  p,  n=0,  ±1,  ...} 


40 

Since  Aj  =  tt_ ( s r ^ )  +  tt+(SI\)  =  7r_(Srj),  j=l,  ....  p,  the  columns  of 
A  =  [ Aj  |  A 2  !  ...  |  Ap]  are  in  H'(CP)  and  by  (i) 

A*(z)A(z)  =  T*(z)  T(z)  z  =  elX  • 

If  we  define 

fyy C A)  =  ±  T*(eiX)  T  (elX) 

the  theorem  follows  provided  we  show  that  T  is  outer  and  XA  conjugate 

outer.  Let  U  =  L,  and  define: 

X 

d_  =  v  {vnA  ,  ....  /nA  }ch;  (Cp)  ,  d  =  v  { xnsr . xnsr  }CSH„(CP) . 

n< -1  1  P  1  n>0  1  P  L 

Let 

(4.7)  H  =  (  V  UnDJ  V  (  V  uD+) 

ncz  "  nez 

It  is  easily  verified  that  (3.1)+  -  (i),  (ii)  holds  for  (U,D+).  In 
[2]  Adamjan-Arov  show  [2,  Th.  2.5]  that  a  quadruple  (U,H,D  +  ,D_) 
satisfying  (3.1)+-  (i),  (ii)  and  (4.7)  has  a  scattering  matrix  S  which 
is  unitary  valued  a.e.  on  T  iff 

V  UnD  =  H  =  V  UnD_ 

nez  neZ 

and,  moreover,  from  their  generalized  functional  model  [2,  Th.  2.1]  we 
need  have 

D_  =  H~2  (CP)  ,  D+  =  SH2(CP)  . 

A  straightforward  computation  (mimicking  the  one  in  section  3)  gives 


and  the  result  follows. 


4.14  Remark.  We  wish  to  comment  here  on  a  conceptual  point.  Nc 
that  the  fact  that  S  (or  H  )  are  used  subsequently  in  various  model 
reduction  problems,  and  realization,  is  in  no  way  contingent  upon  S 
determining  f^  .  Indeed ,  in  all  those  cases  we  use  explicitly  f  ii 
that  we  are  able  to  identify  Fy(0)  =  X/.  or  F  y(0)  =  r  . 


CHAPTER  FIVE 


BLOCK  MATRIX  REPRESENTATIONS 


In  order  to  economize  notation,  we  will  denote  the  matrix 

'5’  \  h  i 

I  .  Also,  as  before, 

p/ 


(^•Vh).  .  hy  <z>n>  •  i 

i ,  j=l ,  ■  •  •  ,p 


1\  /  1 

,  n  = 

\n 


U5  = 


Vh 

U5„ 


00  00  1 

Recall  that  {vCn)}  ^  ,  {Un)}M  are  the  forward  and  backward 

00 

innovation  processes  for  {yCn)}.^  .  Let 


Tij  =  *£(])<  l(i)> 


i.jeZ 


By  stationarity 


T..  =  <U]s(0), 


U\(0)>  =  <c(0),  U 


(0): 


=  t. 


i~j,0 


Denote 


(  =  X  ’ 

n  n+1,0 


n sz  . 


We  form  the  two-way  infinite  Toeplitz  block  matrix 


+  They  are  determined  to  within  a  choice  of  ortho  normal  bases  in 
in  N  ,N.  which  we  fix.  See  also  Remark  3.8. 

“  T 


A  = 


'  0 


c< 


) 


1 

Y 


1 


Y 


2 


Y 


2 


Y 


2 


From  the  definition  of  the  scattering  operator  via  the  innovations 
representations  we  obtain  for  the  Laurent  operator  (with  respect  to 
the  standard  basis) 


[Ls  )  =  A  . 

As  to  the  Toeplitz  operator  Ts ,  observe  that  with  respect  to  the 
orthonormal  bases 


(v,(n),  v9(n) . ->  (n)  ^  *  {e - (n) ,  e2(n), 

1  z  p  n>l 


P 


(n)} 

n  >0 


in  Dx,  D  the  operator  P  |D  has  the  representation 
+  D4- 


”<c(0),  v ( 1 ) >’  <£(1),  v(l)>’  <j:(2)  ,  \(1)>' 

'0  :-l  -2 

<e(0),  v(2)>'  <£(1),  v(2)>’  . 

Y1  ^0 

s  <-(  fn  \  . 

,  vvo;'7  . . . .  • 

2  ’ 

and  from  lemma  4.2 

!TS]  -  I 


Similarly,  for  Hc  we  note  that  with  respect  to  the  orthonormal  bases 
5 


{v,(n),  v„(n) ,  ....  v  (n)}  ,  (c,(k),  £9(k),  ....  £  (k)} 

12  p  n<0  1  2  p  k>0 

in  D_,  D+,  the  operator  P_P+  has  the  matrix  representation 


~  _ 

— 

<£(0),  v(0)>’  <£(1),  v(0)>'  <£(2),  v(0)>' 

Y-l  Tl2 

< £(0)  ,  v(-l)>'  < £(1)  ,  v(-l)>'  . 

'-2  -3 

<£(0) ,  v (—  2)> ' . 

V  . . 

'-3 

so  that  by  Lemma  4.2 
[Hs]  =  2  . 


Let 


i 

|Ls>  11 

i 

i 

i 

) 

i 

<y12 

[y  21 

-I — 
i 
i 
i 
i 

<y  22 

i 

be  the  partition  of  [L^]  corresponding  to  the  orthogonal  decomposition 
L2(CP)  =  H2(Cp)  ©  H2  (CP).  By  our  previous  considerations, 

^21  =  tHS]’  [LS]11  =  * 

[Lg]^  is  the  matrix  representation  of  the  Hankel  operator  mapping 
H2  (Cp)  —  H2(CP)  taking  f  •—  iT+(Sf).  Similarly,  the  Toeplitz  matrix  IL5122 
which  represents  the  map  H‘(CP)  —  H~(CP)  takes  f*-TT_(Sf).  Combining 
with  Theorem  4.4,  we  obtain 


5.1  Theorem.  The  scattering  operator  S  has  the  two  way  infinite 
Toeptitz  block  matrix  representation 


[Lsl  = 


1 

H 

Co 

i  * 

i 

i  i 

[Hg] 

i 

i  ** 

i 

i 

where  [Hg]  determines  [Lg]  uniquely. 

We  next  obtain  a  representation  for  [Hg]  in  terms  of  the  moments 


Cn  =  <y(n),  y(0)>  =  ft  U)elnX  dA  ( B(  Cp) ,  n  « Z  . 

-71  - - 


Using  the  representation  of  ^(n)  in  terms  of  the  backward  and  forward 
innovations  (Chapter  3) ,  we  obtain 


00  OO 

(5.1)  C  =  <y(n)  ,  y (0)>  =  <  ^  B(k)  e(k+n) ,  A(-m)v(-m)> 

n  k=0  — n 


m=0 


OO  CO 


■  2  I 


B(k-n)  Y' 


k=n  m=0 


(m+k+1) 


A  (-m) 


Direct  computation  gives 

OO  00 

(5.2)  ^B(k)B*(k+n)  =  Cn  =  A(-m) A*(-m+n)  n=0,  1,, 


k=0 


m=n 


Form  the  triangular  block  matrices 


In  accordance  with  (3.13),  (3.15)  we  call  B,  A,  the  outer,  conjugate 
outer  factorizations  of  the  Toeplitz  form  induced  by  the  moments 
{Cn>Q  (C_n  =  C*).  Combining  with  (5.1)  gives 


c'o 

c'  c ' 

■  ■  ■  ■ 

A*[HS]B  = 

c; 

c'2 

C2 . 

_ 
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CHAPTER  SIX 

MARKOV  PROCESSES  AND  UNITARY  DILATIONS 


In  a  Hilbert  space  setting,  a  centered  stationary  process  (x(n)}00 

—  C 

x(n)  =  /xi^n^  \  is  said  to  be  Markov  if  for  all  n  £  s 


xm(nV 


(6.1)  PHx(s)x  PX(s)X  xeHx(S)  , 

where  X(s)  =  span{x.(s)  :  j=l,  — ,  m}.  In  our  setting,  all  stationary 
processes  will  be  generated  by  the  shift  U  (on  H  )  associated  with  the  y 
process.  Thus,  for  a  stationary  process  (x(n)}^  (in  H^)  we  will  have 
x(n)  =  Unx(0).  It  readily  follows  from  (6.1)  that  one  can  define  the 
notion  of  a  Markov  subspace  with  respect  to  U  in  the  following  (see  [23]  , 
[37]). 

6.1.  Definition.  A  subspace  XCH  is  said  to  be  Markov  with  respect  to 
U  if  for  all  n>s,  xeX 

(6.2)  P  Unx  =  P  Unx  . 

vumx  uSx 


Thus  X  is  a  Markov  subspace  with  respect  to  U  iff  the  process  {UnX}~ 

has  the  (weak)  Markov  property.  In  what  follows  a  Markov  process 

n  co 

{U  Xj^will  invariably  arise  in  this  fashion. 


We  shall  be  interested  in  Markov  subspaces  (with  respect  to  U) 


XCH^  for  which 

(6.3)  {y j (0) . yp(0)}  cx  . 

In  this  case  X  satisfies 


00  n 

H  _  VUnX  , 

y  -  -oo 

and  we  say  that  X  (or  {UnX}~oo)  is  of  full  range.  There  is  a  direct 
relationship  between  Markov  processes  of  full  range  and  unitary  dilations 
Recall  [30]  that  a  unitary  operator  U  on  a  Hilbert  space  H  is  said  to  be 
the  minimal  unitary  (power)  dilation  of  a  contraction  A  on  XCH  if 


An  =  PxUn|X  n  ;>  0 

and  H  =  _VUnX  ( minimality ). 


6.2 

V 


Proposition.  XCH^  is  a  Markov  subspace  of  full  range  iff  U  (on 
is  the  minimal  unitary  (power)  dilation  of 


A  =  Px  U|X  :  X  X  . 


Proof.  From  (6.2),  we  obtain  for  x,x'eX  and  m,n>0 
(U_mx,  Unx')  =  (U'mx,  PxUnx)  . 


+ 


We  shall  subsequently  omit  the  phrase  "with  respect  to  U". 


Thus,  denoting  A(n)  =  PXU  |X,  we  obtain 


(x,  A(m+n)x')  =  (x,  Um+nx')  =  (U  mx,  Unx')  =  (U  mx,  PxUnx')  = 

=  (x,  PxUmPxUnx’)  =  (x ,  A(m)A(n)x’)  . 

We  conclude  that  A(m+n)  =  A(m)A(n)  and 

A(n)  =  An(l)  =  An . 

•  .  co  n 

Since  X  is  of  full  range,  i.e.  ,  K  =  VU  X,  we  conclude  that  U  in  H  is  the 

y  -oo  y 

minimal  unitary  dilation  of  A  (in  X).  This  proves  the  'only  if  part.  Th 
'if  part  follows  by  reversing  the  argument. 

The  semigroup  {An  }n>Q=  Pj^U^X^q  be  ca^ec*  the  ^,Qr^0V  semigroup 

Much  in  the  same  way  as  for  a  regular  process  we  make  the  following: 

6.3  Definition.  A  Markov  process  {UnX}“oQC  is  said  to  be  regular  if 

A  V  UnX  =  {0}  =  A  V  UkX 
n<0  k<n  n>0  k>n 

We  shall  occasionally  also  refer  to  (the  Markov  subspace)  X  as  being 
regular. 

The  notion  of  regularity  for  a  Markov  process  is  intimately  related  to  the 
asymptotic  stability  of  the  static  operator  A  (see  Corollary  6. 5  and 
Theorem  6.11) . 

The  correspondence  between  Markov  processes  and  scattering  systems  is 
established  in  the  following: 

6.4  Theorem.  Let  XCH  be  a  regular  Markov  subspace  of  full  range. 
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Then  Hy  decomposes  and,  moreover,  uniquely  into  the  orthogonal  sum 

(6.4)  H  =  D  ©  X  ©  D+ 

where  (U,  H^,  D+,  D_ )  is  a  scattering  system. 

Proof.  Define 

D+  =  (  V  UnX)  ©  X 

(6.5)  n-° 

D_  =  (  V  UnX)  ©  X 
n<0 


We  first  show 

(6.6)  UD+c  D+ 


Note  that 


D.  =  V  (I  -  PY)UnX  . 

+  n>0  Hy  X 

For  arbitrary  x.x'eX  and  n>0 

(U(IH  ~  Px)Unx,x')  =  (Un+1x  ,x’)  -  (UPxUnx,x’)  = 

(An+1x,x’)  -  (UAnx,x’)  =  (An+1x ,x')  -  (AAnx ,x')  =  0  . 

It  follows  that 


UDf*  X  , 

and  since  UD  c  V  UnX  we  conclude  (6.6).  Similarly,  we  obtain  U*DCD  . 
n^O 


To  prove  that  D_  D+  we  note  that  it  suffices  to  prove 


D.  ^  V  UmX  . 
m<0 


Since  for  arbitrary  x.x'eX,  and  n,m>0 


((IH  '  Px)Unx’  u~mx')  =  (Unx.  U_mx’)  -  (PxUnx,  U  mx')  = 

y 

=  (Am+nx,x')  -  (AmAnx,x’)  =  0 


the  conclusion  follows.  To  prove  that  A  UnD+  =  (0>  observe  that 


V  UnX  =  H  A  Unx"  =  {0 1  +*■  A  Un(D  ©  D  )  =  t0}^-> 

-oo  y  -oo 

(AU nD)  ©  (AUnD)  =  {0}  . 

-OO  -« 


By  regularity,  we  obtain  that  on  the  space  K  =  V  UnX  the  operator 

n>0 

U+  =  U|K+  is  an  isometry  having  no  unitary  part  (Wold's  decomposition 
[30,  Th.  1.1.1]).  Moreover,  from  the  structure  of  the  space  of  the 
minimal  isometric  dilation  (of  A)  [30,  section  II.  2],  we  obtain 


K  =  ©  U  n+1N_  N  =  D  ©  U*D 

n=0 


Since  X  is  of  full  range,  we  have  Hy  =  ^©  UnN_  and 


(6. 7)_  VUnD  =  H 

-oo  —  y 


In  a  similar  way, 


We  therefore  established  that  (U,Hy,D+,D_)  is  a  scattering  system.  To 
prove  uniqueness,  note  that  if 

Hy  =  Dl  ©  X  ©  D'+ 

!  f  1 

is  another  decomposition,  then  UX  UD_  D  D_  and,  thus  UX  C  X  ©  D+  . 
It  follows  D+C  X  ©  D+  ,  since  D+  X  we  obtain 


Similarly , 

D_  C  D'  . 


However,  D_  ©  D+  =  d'_<£)D'+  and  uniqueness  follows. 

6.5  Corollary.  A  full  range  Markov  subspace  is  regular  iff  + 

An -*■  0 ,  A*0^  0  (n-*-  °° ) 


strongly. 

Proof.  From  the  proof  of  the  last  theorem  it  follows  that  X  is  regular  iff 
(6.7)  holds.  Combining  with  [30,  Th.II.1.2]  gives  the  desired  result. 

6.6  Definition.  A  scattering  system  (U,H,D+,D_)  for  which 


+ 


For  a  finite  dimensional  X  both  convergences  are  equivalent. 


5 


is  called  a  Lax-Phillips  (L-P)  scattering  system. 

Let  {UnX}“oo  be  an  arbitrary  regular  Markov  process  of  full  range, 
and  (U ,Hy,D+,£Ox  its  associated  L-P  scattering  system.  Let  Qx(e^)  be 
the  corresponding  scattering  matrix.  For  the  corresponding  incoming 
spectral  representation  F x,  we  obtain  from  Proposition  4.1 

FX[D_]  =  H~  (CP)  ,  [D+]  =  0xH2(Cp)  . 

Since  D+MD_ 

OxeEJB{CP))  . 


To  each  regular  full  range  Markov  process  there  is  thus  associated 
an  inner  function  0X>  which  is  the  scattering  matrix  of  the  corresponding 
L-P  system  (U,H  ,D+,D_)X.  According  to  Proposition  6.2  and  Theorem 

y 

6.4,  characterizing  those  inner  functions  amounts  to  characterizing  all 
those  L-P  systems  (U,H ,D+,D_)  for  which  U  on  H  is  the  minimal  unitary 
dilation  of  the  contraction  A  =  PXU|X,  X  =  H  ©  (D_  ©  D+) .  By 
[27,  Th.  III. 1.1],  U  on  H  is  a  dilation,  and  combining  with  [2,  Th.3.3], 
we  obtain 


6.7  Proposition.  The  scattering  matrices  0  associated  with  regular  full 
range  Markov  processes  are  those  and  only  those  inner  functions 
OfH^IKcP))  for  which 


(6.8)  MO(O)  ||  <  1  . 
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An  inner  function  GeH^iBiC^))  satisfying  (6.8)  is  called  purely  contractive 
[30,  p.  188].  For  p  =  1,  this  amounts  to  being  non-trivial. 

Recall  that  the  scattering  matrix  was  defined  up  to  left  and  right 
multiplication  by  constant  unitary  matrices  on  C^.  This  follows  from  the 
arbitrariness  in  choice  of  orthonormal  bases  in  the  forward  and  backward 
innovation  subspaces,  and  cannot  be  avoided.  We  make  the  following 

6.8  Definition  [30,  p.  132].  QeHJBCCP)) ,  and  G^HjBiC^))  are  said 
to  coincide  if  for  unitary  maps  ij ,  1 2  on  CP 

t2G1(z)  =  G(z)  |  z  |  <  1 


The  equivalence  relation  between  Markov  processes  is  obtained  by 

6.9  Definition.  We  say  that  the  Markov  processes  (U  X)  „,  {U  Xj  „ 

-  -oo  I  -OO 

are  equivalent  if  ,  Qx  coincide. 

It  readily  follows  from  [30,  Th.  VI.  2.3]  that  the  corresponding  Markov 
semigroups  are  unitarily  equivalent. 

While  the  correspondence 


X  —  (U,  H^,  D  +  ,  D_)x 

is  1-1,  the  same  does  not  hold  for  *  two  distinct  Markov  processes  may 
have  coinciding  scattering  matrices.  However,  as  we  shall  see  in  the 
next  section,  if  we  see  (as  we  do  in  our  context)  two  processes  y,  y^, 
as  equivalent  if  they  have  the  same  density  f  then  the  above  equivalence 

yy 

relation  is  the  right  one. 

Combining  with  Proposition  6.7,  we  obtain 


6.10  Theorem.  All  equivalence  classes  of  full  range  regular  Markov 
processes  are  parametrized  by  the  inner  functions  GeH^BCcP))  such 


that  I i 0 (0)  ||  <  1. 

We  next  obtain  a  dynamical  representation  for  a  Markov  process. 

6.11  Theorem.  Let  X  be  a  Markov  subspace.  Let  {x  (0),  ...,  x,  (0)} 
(k<oo)  be  a  complete  orthonormal  basis  in  X.  Then  for  x(n)  =  Unx(0) , 
x(0)  =  /xif°)  \  We  have 

(6.9)  x(n+l)  =  [A]x(n)  +  [B]w(n)  n  >  0 

where 

.00  t  + 

(i)  {  wCrOi^is  a  normalized  white  noise  vector 

process  of  dimension  p. 

(ii)  {w(n)  >n>Q  is  orthogonal  to  x(0)  . 


The  above  representation  determines  [A]  (the  matrix  representation  of 
the  state  operator  A)  and  [B]  up  to  unitary  equivalence. 


Proof*  Let  (U,  H  ,  D+,  D_)x  be  the  corresponding  scattering  system. 

^OO 

Write  D.  =)  .  ©  UnN  ,  N  the  backward  innovation  subspace  at  t  =  0 

.  .  .  /wi(°)  \ 

for  D+,  dim  N  =  p.  Let  the  entries  of  w(0)  =  (  ;rft./  be  an  orthonormal 

WpAUJ 

basis  in  N+  with  w(n)  =  Unw(0).  Since  UnN+MN+  (n  4  0)  (wtn)}^  is 
a  normalized  white  noise  process,  and  XAD+  implies  that  {w(n)>n>0  is 
orthogonal  to  x(0).  From  (6.5) 


UXC  X  ©  N  + 


+ 


Its  covariance  matrix  equals  6(n)I 


d  • 
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and  thus 


(6.10) 


Ux.(0)  =  PxUXj(0)  +  PN  Ux.(O)  =  Axj (0)  +  PN  Ux.(O) 


Let 


a..  =  (AXi(0),  x.(0))H  .  b..  =  (Ux.(0),  w.(0)>H  , 


and  define 


[A]  =  (ay)  ,  [B]  =  (b..)  . 


From  (6.10) 


x(l)  =  [A]x(0)  +  [B]  w(0)  , 

applying  U  to  both  sides  gives 

x(n+l)  =  [A]x(n)  +  [B]w(n) 

Since  the  above  representation  is  unique  up  to  a  choice  of  orthonormal 

bases  in  X ,  N  the  theorem  follows. 

+  <« 

The  notion  of  equivalence  between  Markov  processes  is  naturally  manifested 
in  their  dynamical  representations: 

6.12  Theorem.  If  (UnX}”oo,  {UnX1)^co  are  equivalent  Markov  processes 
(with  X  as  in  Theorem  6.11),  then  Xj  admits  representation  (6.9)  for 
some  Xj,  Wj  playing  the  role  of  x,  win  Theorem  6.11. 

Proof.  Let  (U,  H  ,  D,  ,  D  )v,  (U,  H  ,  D\  ,  D'  )v  be  the  corresponding 
y  +  -A  y+A. 


scattering  systems,  anci  the  incoming  spectral  representations 

It  readily  follows  that  for  the  unitary  operator 


(6.11) 


-  <Fx1>'lf'x 


we  have: 


(i)  L  [D+]  =  D'+  ,  L[  X]  =  Xx 

(ii)  LAx  =  AjLx  xX  ,  A  =  P^U|X,Aj—  Pj^^U|Xj 


The  result  follows  from  the  construction  in  Theorem  6.11. 


CHAPTER  SEVEN 


MARKOVIAN  REPRESENTATIONS  AND  FACTORIZATION 
OF  THE  SCATTERING  MATRIX 

A  regular  Markov  subspace  X  c  h  is  s&fd  to  be  a  representation 
for  the  process  y  if 

{yj(0),  ....  yp(0)}c  x  . 

Indeed,  in  this  case  we  can  write  (notation  as  in  Theorem  6.11) 
y(0)  =  [C]x(0) 

for  some  matrix  [C]  ,  and  applying  U  to  both  sides  gives 
(7.1)  y(n)  =  [C]x(n) 

a  dynamical  representation  for  the  regular  process  y  in  terms  of  the 
regular  (and  necessarily  full  range)  Markov  process  x.  We  first  show 
that  equivalent  Markov  processes  are  indistinguishable  provided  we  do 
not  distinguish  between  processes  y,  y^  having  the  same  spectral  density 
(and  see  them  as  equivalent). 

7.1  Theorem.  Let  X  c  H^  be  a  regular  Markovian  representation  for  y. 

If  {UnX^}_oo  is  equivalent  to  {UnX}”oo,  there  exists  a  process  yj  equivalent 
to  y  for  which  X^  is  a  (regular)  Markovian  representation,  and  for  which 


(7.1)  holds  (with  Xj  playing  the  role  of  y,  x). 

Proof.  Let  L  :  H  -*■  H v  be  as  in  (6.11).  By  the  properties  of  the 

j L  x. 

incoming  spectral  representation 

LU  =  UL  . 

Define  (componentwise) 

2^(0)  =  Ly(0)  ,  y^n)  =  Unyj(0) 

We  obtain 

<Z1(n) ,  Y1(°)>  =  <UnLy(0),  Ly(0)>  =  <LUny(0),  Ly(0)>  =  <y(n) ,  y(0)>  , 
and  therefore 


(*)  =  WX) 


a.e.  X 


Since  L[X]  =  Xj  ,  Xj  is  a  regular  Markovian  representation  for  yj  and  by 
Theorem  6.12  (7.1)  holds.  , 

By  the  last  theorem  and  Theorem  6.12  we  can  translate  the  question 
of  finding  all  regular  Markovian  representations  for  y  to  L2(CP).  For 

each  inner  function  0eH  (B(CP))  (as  in  Theorem  6.10)  we  consider  the 

00 

orthogonal  decomposition 

L2(CP)  =  Q*H2(CP)  ©  ((H2(CP)  ©  0*H~(CP))  ©  H2(CP) 


+ 


Note  that  the  converse  is  trivial,  see  Theorem  6.12. 


inducing  the  L-P  scattering  system 


(X,  L 2(CP)  ,  H2(Cp)  ,  e*H2(Cp)]  , 

+ 

corresponding  to  the  full  range  regular  Markov  subspace  (with  respect 
to  X) 


X  =  H2(Cp)  0  0*H2(Cp) 

Now  recall  that  for  the  incoming  spectral  representation  F_  (for 
the  scattering  system  associated  with  the  y  process) 

F_y(  0)  =  X  A 

Thus,  finding  all  regular  Markovian  representations  for  y  reduces  to 
the  following: 

D  ++ 

Find  all  inner  functions  G^eH^QKC*))  such  that 

(7.2)  xagh2  (cp)  O  e*  H~(CP)  . 

7.2  Theorem.  All  regular  Markovian  representations  of  y  are  parametrized 
by  those  and  only  those  inner  functions  3  for  which 


The  corresponding  decomposition  in  which  X=  H2(CP)  0  0H2(CP) 
(the  model  space,  cf.  [37])  will  not  be  used  here  since  we  wish  Hg  to  play 
a  central  role. 

++  Since  A  is  conjugate  outer  the  corresponding  Markov  subspace  will 
automatically  be  of  full  range. 
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(7.3)  s  =  e*e2 


02fHoo(B(Cp)) 


Proof.  (7.3)  holds  iff  6^A«H2(CP)  iff  G16TeH2(CP).  Since  r  is  outer 
the  latter  holds  iff  O^sH^C15).  Since  Q^e  LJiB(Cp))  the  result  follows.^ 
The  possibility  of  writing  the  scattering  matrix  S  in  the  form  (7.3) 
has  an  interpretation  on  a  process  level.  By  the  Beurling-Lax  theorem, 

(7.3)  holds  iff  (the  invariant  subspace  for  the  left  shift): 

(7.4)  H2(Cp)  ©  (range  H^)  is  of  full  range  (for  >.). 

By  Lemma  4.2  this  is  equivalent  to 


(7.5) 


H  (0)  0 


Pu-  H  (0)  is  of  full  range  (for  U). 

I  1 


An  LJB(CP))  function  satisfying  (7.4)  is  called  strictly  non-cyclic  ([12], 
[15]).  The  process  y  satisfying  (7.5)  (i.e.  ,  having  a  strictly  non-cyclic 
scattering  matrix)  is  called  strictly  non-cyclic  [29]. 


Recall  that  two  inner  functions  U  ,V  €  Hro(B(  CP) )  are  left  coprime  if  for 
no  non-trivial  [i.e.,  for  no  unitary  matrix  in  B(CP)]  inner  function  W 
we  have 


U  =  WUj  ,  Vj  =  WVj 


with  Uj,  Vj  inner.  Let 


£>2  is  necessarily  inner  since  S  is  unitary  valued  on  T. 

++  Por  the  p  =  1  dimensional  case  the  equivalence  of  (7.4)  and 

(7.5)  is  due  to  Levinson-McKean  [28]. 
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S  =  Qf  Q2 

be  the  (left)  coprime  factorization  for  S.  For  the  inner  function  Qj  we 
have  by  the  Beurling-Lax  theorem 


range  Hg  =  H~(CP)  ©  Q*  H~2  (Cp)  . 

If  S  =  U*U2  *s  some  other  factorization,  then  we  necessarily  have  for 
some  (non-trivial)  inner  function  W 


Uj  =  WQl  ,  U2  =  WQ2  . 


Now  observe  that  for  inner  functions  U, ,  Q,  in  H  (£(CP)) 

11  CO 


U*H2(CP)  c  q*h2(cp) 


iff 

QjU*  H2(CP)  c  h2(cp)  , 
i.e.  ,  iff  for  some  inner  function  W 


(7.6)  Uj  =  WQj  . 


It  follows  that  for  the  (regular  full  range)  Markov  subspaces 


X  =  H2(cP)  ©,  Q*H2(CP),  Xy  =  H2(Cp)  ©  U*H2(CP) 


we  have 


Thus,  XQ  is  the  smallest  (setwise)  regular  Markov  subspace 


representing  y.  Let  us  call  a  regular  Markov  subspace  XCH  representing 

JL 

y  minimal  if  no  proper  subspace  of  X  +  is  a  Markovian  representation  of 


v.  Note  that  for  the  minimal  subspace  X_  we  have  the  orthogonal 


decomposition 


L2(CP)  =  Q*H2(CP)  ©  (H‘(CP)  ©  Q*H2(CP))©  H2(CP) 


-1 


which  under  F_  goes  to 


(7.7)  HJ,=(H'(0)  ©  PH+(Q)  )©  Ph-(0)H*(0)  ©  H*(0)©  H‘(0)  . 

Y  Y. 

We  established  the  following: 

7.3  Theorem.  Let  y  be  strictly  non-cyclic,  and 

S  =  Qj  Q2 


the  (left)  coprime  factorization  of  S.  Then  the  equivalence  class  of  all 
regular  Markovian  representations  of  y  (or  an  equivalent  process  to  it) 
which  corresponds  to  is  minimal.  This  class  is  represented  in 
L2(Cp)  by  the  subspace 

XQi  =  H“(CP)  0  Q*  H-  (Cp)  =  range  Hs  . 


+ 


Which  is  necessarily  regular. 
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For  each  such  Markov  subspace,  there  exists  a  process  t,  equivalent  to 
y  for  which  (7,1)  holds. 

In  a  similar  fashion,  one  obtains  that  all  factorizations 


s  =  U2U* 


parametrize  via  the  inner  function  all  regular  Markov  subspaces  XCH 
representing  y.  The  minimal  one  is  obtained  by  the  (right)  factorization 
S  =  P2Pi*  and  *ts  rePresentati°n  w  L2(C'P)  is  )(p  =  H2(CP)©P1H2(CP) 

(see  footnote  on  page  60  ). 

A  number  of  observations  are  in  order.  First,  note  that  the 
general  degree  theory  for  strictly  non-cyclic  functions  [15,  Ch.  III.  5] 
arises  naturally  in  our  context.  All  regular  Markovian  subspaces  XCH 
representing  y  which  are  in  the  equivalent  class  parametrized  by  the 
(purely  contractive)  inner  function  Q'H  (B(CP))  will  be  of  degree 


(7.8)  d(Q )  =  det  Q  , 


an  inner  function  in  Hoo(its  scalar  multiple,  see  Nagy-Foia^  [30,  p.  216]). 
From  (7.6)  we  obtain 


d(Uj)  =  d(Q1)d(W) 


and  thus  the  degree  of  the  minimal  subspace  is  the  lowest  in  the  sense 
that  d(Q)  is  the  weakest  (i.e.  ,  it  is  an  inner  divisor)  among  the  degrees 
of  all  other  regular  Markovian  subspaces  representing  y.  Second,  we 


With  £  playing  the  role  of  y,  and  Hr  =  H  . 


observe  that  by  inspection  [see  (7.7)],  all  minimal  Markovian  subspaces 

+ 

representing  y  are  observable  and  constructive  and  conversely ,  all 
observable  and  constructive  regular  Markov  subspaces  representing  y 
are  minimal.  Moreover,  we  have  exact  observability  and  constructibility 
iff  range  is  closed. 


_ +  The  regular  Markov  subspace  X  is  said  to  be  observable  if 

PYH^  (0)  =  X,  and  constructible  if  PVH  (0)  =  X.  It  is  said  to  be 
A  1  A  I  + 

exactly  observable  or  constructible  if  PYH  (0)  =  X,  PVH  (0)  =  X 

Ay  a  y 

respectively.  See  [37]. 


CHAPTER  EIGHT 


MODEL  REDUCTION 

The  question  of  model  reduction  for  stationary  processes  is 
distinct  in  nature  from  the  corresponding  one  in  systems  theory.  In 
the  latter ,  model  reduction  has  a  natural  formulation  in  terms  of  inputs 
and  outputs.  Thus,  the  system  is  represented  by  a  black  box: 


input 


output 


whose  output  \^el2 (~ °°.  Cq)  when  inputed  by  uel^-00,  Cp)  is  given  by 
Y  =  Lu 

where  L  :  l2(-°°,  ls  a  1301111(16(1  c&usal  [L12(0,  °°;CP)C 

12 C 0 »  (f1)]  Laurent  operator  (input-output  map)  with  symbol 

TeHjiB(C^ ,  C^))  (frequency  response  function).  The  (minimal)  state 
space  of  the  system,  i.e.  ,  the  lowest  dimension  of  x  in  the  dynamical 
representation  + 

x(n+l)  =  [A]x(n)  +  [B]u(n) 

(8.1) 

ip(n)  =  [C]x(n) 


+  We  assume  T(0)  =  [0]. 


equals  rank  HT  =  k.  One  way  to  formulate  a  model  reduction  problem 

in  this  context  is  the  following:  Given  T  find  a  (system)  function 

T  e  H  (J3( C- ,  C  ))  (with  corresponding  Laurent  operator  L  )  with  rank 
m  00  in 

rank  H^;  5  m  <  k ,  such  that  if  the  two  systems  were  inputed  by  the 
m 

same  u_el2(-oo,  -1;  CP)  up  to  time  t  =  0: 


u 


for  the  corresponding  outputs  from  time  t  =  0  onward  :  ^+,  ^+el2(0 ,  °°;Cq) , 
we  have 


II V  v+||j  =  min. 

2 

over  all  u_e  1 (— ,  -1;CP).  ||u_!!  <  1  (normalization  condition) .  Taking 

Fourier  transforms,  we  readily  obtain 

min  sup{|j(HT-HT  )$j|L  :  4>eH'(CP),  ||<J>liL  5  1) 

•j>  m2  2 

m 

where  the  min.  is  taken  over  all  T  ®H  (J3(CP,C^))  such  that  rank 

in  00 

H~  5  m.  Thus,  we  have 
m 


H^,  being  the  Hankel  operator  from  H2(CP)— •  H2(C^) 


+ 


i'-’  -!■  -1T».» '•'■  .T-  ■ j  ■■  r*  ■  i  ■■.  i  .  i  >'1..  i  ■  -f  r>.T'n 
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min 

A 

T 


m 


a  classical  Hankel  approximation  problem  [7], 

This  input-output  notion  is  not  adequate  in  the  context  of  stationary 
sequences.  While  a  realization 


x(n+l)  =  [A]x(n)  +  [B]w(n) 

(8.2)  n  >  0 

y(n)  =  [C] x(n) 


resembles  the  one  in  systems  theory,  one  should  keep  in  mind  that  the 

Co 

notion  of  inputs  is  absent,  and  the  sequence  {y(n)}oo(or  a  sequence 
equivalent  to  it)  plays  a  distinct  role.  One  approach  close  in  nature  to 
the  above  model  reduction  formulation  is  to  approximate  this  sequence. 

We  wish,  however,  to  emphasize  that  while  we  formulate  the  approximation 
problem  geometrically  in  the  space  H^,  the  object  we  seek  is  the 
approximating  spectral  density.  Thus,  all  approximating  sequences 
which  follow  will  'live'  in  H  and  will  be ,  with  respect  to  the  unitary 

jcL 

operator  U,  on  Hy.  The  resulting  processes  will  be,  therefore, 
stationary  correlated  with  y.  Observe  that  approximating  the  sequence 
(yCrDUor  an  equivalent  sequence  to  it  are  thus  indistinguishable. 


Problem  A.  Let  {^(n)}”^  be  the  given  process  with  (minimal) 
representation  (8.2),  dim  x  =  rank  =  k  <,  °°.  Find  a  regular  process 
(y(n)  }  of  dimension  p  such  that  for  its  corresponding  (minimal) 


representation 


V. 


x(n+l)  =  [  A] x(n)  +  [B]w(n) 
X(n)  =  [C]x(n) 


we  have 


(i)  dim  x  ^  m  <  k  , 
and ,  moreover 

(ii)  sup{||y,(n)  -  y.(n) | j  :  j=l,  ....  p,  nez}  =  min. 

3  3  H 

I 

A 

Assume  first  p=l.  Let  (yfn)}^  be  a  solution  and  X  one  of  its  minimal 
regular  Markovian  representations  (dim  X*m).  Define 


y(0)  =  P£y(0) 


By  the  orthogonality  principle,  we  obtain 


~2(0)  =  ||y(0)  -  y(0)  ||2  >  ||y(0)  -  y(0)  ||2=  £(0)5 


Clearly  for  the  process 


{Uny(0)  :  neZ) 


the  (minimal)  state  space  is  of  dimension  not  exceeding  m.  By 


stationarity 


e(n)  =  ||y(n)  -  y(n)  ||  =!|Uny(0)  -  Uny(0)  ||  =  e(0) 


e(0)  =  sup  { ||y(n)  -  y(n) |r  :  neZ}  *  e(°> 


We  readily  conclude 


*****  f .  * 


>  \*; ..  .. ; ..  /•%*■ 

*.  *\*\  ■»*«**.  < 

••  •>  vN  .  v  *  «  V.  '> .  - 


/  / 
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y  (0)  =  P£y(0) 


As  for  arbitrary  p,  we  note  that  the  above  procedure  extends  componentwise 

A 

for  each  y^  j=l,  ... ,  p,  and  thus  for  the  process  y  we  need  to  have 


Yj(0)  =  P£y.(0) 


j— 1 ,  . . .  ,  p 


Our  problem  thus  reduces  (in  the  space  H  )  to  the  following: 

y 


Find  a  regular  Markov  subspace  XcHy  such  that: 


(8.3) 


(i)  dim  X  £  m 


(ii)  max  dist^  (y.(0),  X)  =  min. 
j=l,  ...  ,  p  y  J 


Observe  that  if  e  is  the  minimum  above 


(C  )  -  (C  ) 

i,j  i.j 


(y^n),  y.(0) )  -  ( yi(n) ,  y- (0) )  |  < 


<  |((yi(n) ,  y.(0)  -  y.(0))|  +  [((y^n)  -  y.(n) ,  y^ (0))[  <  2eoyy 


Thus  for  all  n 


II  c„  -  C  II  <  f(2e)2£  Oyy  l1  *  2^P  [X  °y.y.l*  • 

n  n  L  i  ,j=l  yiyiJ  i-i=l  yiyiJ 


Since  y  is  regular  and  of  full  rank  p,  it  follows  from  [30,  Prop. 

A 

1.2.1]  that  X  is  of  full  range  and,  therefore,  belonging  to  the  equivalent 
class  of  regular  full  range  Markov  subspaces  parametrized  by  the 
(purely  contractive)  inner  functions  0eHoo(B(C^>))  for  which  d(0)  [see 


(7.8)]  is  a  Blaschke  product  of  degree  ^  m.  From  Chapter  7,  (8.3) 
translates  in  L2(CP)  into  finding  a  Q  for  which 

max  dist  „  (X  A.,  h:(Cp)GQ*HI  (Cp))  =  min.  ,  A  =  [A.j...|A  l 
j=l. - P  L^(CP)  3  2  2  1  P 

8.1  Proposition .  Let  0fHec(B(Cp))  be  an  inner  function  and  geH2(Cp). 
Then 

P  ^  g  =  0*7T  Gg. 

H'(Cp)0  0*H~(CP) 

Proof.  One  readily  checks  that  O*’n+0geH2(Cp).  Since 

g-  0*TT+0g  =  Q*[Qg-  7T+0g]  =  0*[(IL  -  TT+)0g]  =  0*  TT_  g£H~(CP)  , 

2 

the  result  follows. 

•4 

Now  observe  that 

XA  -  0*tt+  0  X  A  =  G*[0  X  A  -  TT+0  XA]  =  0*tt_(GXA)  =  0*H0sr  . 

Problem  A  therefore  admits  the  following  equivalent  formulation: 

Problem  A~.  Find  an  inner  function  QcHoo(B(CP))  such  that  ||Q(0)||  <  1 
and  the  following  holds: 

+ 

(i)  d(Q)  $  m  , 

(8.4) 

(ii)  max  ||HQ<-,r .  |j  =  min.  r  =[r.|...jr  ]  . 

i=l,...,p  J  L0  1  p 


+  i.e.,  the  Blaschke  product  is  of  degree  <  m. 


72 


We  first  establish  the  existence  of  Q  solving  (8.4)  for  the  case  p  =  1. 

8.2.  Theorem.  There  exists  an  inner  function  q  +  of  degree  <m 
such  that 


(8.5)  !!Hqs*rl!  =  ^!!Husri!  :  u  inner,  degree  u  s  m} 


Proof.  Denote  by  6  the  infimum  of  the  right  hand  side  in  (8.5)  and  let 
^un^*be  a  sequence  of  Blaschke  products  such  that  |[Hu  r  ||  -*  6.  The 
family  uQ,  Uj,  ...  is  uniformly  bounded  in  |z|  <  1,  in  particular,  on 
compact  subsets  of  |z|  <  1.  It  follows  (e.g.,  [39,  Th.  14.6])  that  there 
exists  a  subsequence  (u^  }  converging  uniformly  on  compact  subsets 

j  j=o 

of  |  z  1  <  1  to  a  function  qeHM  which  is  ?  0.  By  a  theorem  of  Rouche 

[39,  p.  242]  ,  it  readily  follows  that  if  ja|  <  1  is  a  zero  of  q,  then  there 

exists  a  sequence  {a  i  in  |z|  <  1  such  that  lim  a  =  a  and  for  some  N 

j  j*0  .  n; 

]  00 


Since  each  u^  has  at  most  m  zeros  in  |  z  |  <  1 ,  q  has  at  most  m  zeros 

in  | z  J  <1.  Let  0  <  <  1  be  such  that  q  has  no  zeros  in  r^  <  jzj  <1. 

Applying  once  more  Rouche’s  theorem,  we  conclude  that  for  some  integer 

N0.  aU  un  »  i  *  Nq  have  the  same  number  of  zeros  in  j  z  [  <  1  as  q , 

i.e.  ,  none.  This  implies  that  the  family  {u  }  is  holomorphic  in  a 
.  .  ,  ,  "j  j^N0 

region  containing  |z|  <  1.  We  readily  conclude  that  q  is  an  inner  function 

having  at  most  in  zeros  in  jzj  <  I  and  none  on  jz|  =  1,  i.e.,  a  Blaschke 

product  of  degree  <  m. 


+ 


Clearly  q  is  non-trivial. 
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As  for  general  p ,  by  a  normal  family  argument  one  obtains  a 
subsequence  ©n  converging  uniformly  over  compact  subsets  of  |z|  <  1 
[uniform  topology  in  B(C^)]  to  an  inner  function  Q.  Applying  the 
argument  of  the  above  proof  to  the  inner  functions  d(0n  )  which  converge 
(uniformly  over  compact  subsets  of  JzJ  <  1)  to  the  inner  function  d(Q), 
we  conclude  that  Q  is  of  degree  d(Q)  (a  Blaschke  product  of  degree)  <,  m. 
To  show  that  it  is  purely  contractive  ,  we  note  that  if  for  some  0  ^  ace** 
||Q(0)a||  =  | [ a j |  then  Q(z)aeH2(CP)  is  a  constant  function  and  thus, 
considering  ||Q(z)a||eH2  we  obtain  ||0  (z)a||  -*•  ||Q(z)a||  uniformly  over 

compact  subsets.  By  Rouch^s  theorem,  this  is  impossible  since  the  Gn 
are  purely  contractive.  We  conclude  that  Q  is  purely  contractive. 

The  solution  obtained  is  not  necessarily  unique,  and,  moreover,  it 
is  not  obtained  in  a  constructive  fashion.  A  weaker  version  (which  is 
nonetheless  no  less  interesting)  is  obtained  by  trying  to  minimize  ||H^S|| 
over  all  admissible  0.  For  the  p  =  1  dimensional  case,  a  constructive 
solution  can  be  obtained  by  employing  the  analytic  properties  of  Schmidt 
pairs  for  a  Hankel  operator  [7].  Thus,  since 

llHusrll  S  i|Hus  II  I|r|| 


we  have 

min{||H  r  ||  :  u  inner,  degree  u<m}  s  min{||H  j|  :  inner,  degree  usm}*  ||r  || 

Uo  us 

and  we  consider 

+ 

Problem  B.  Find  an  inner  function  q  such  that 


+  This  inner  function  q  will  be  found  to  be  non-trivial  (see  Theorem 

6.10). 
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(8.6) 


(i)  degree  q  ^  m 

(ii)  l!HqS  tl  =  min  • 


We  make  the  assumption  (which  we  later  characterize  on  a  process  level) 
that  Hg  is  a  compact  operator.  Let 

1  =  s0(Hs)  >  si(Hs)  > 

be  the  enumeration  of  its  non-zero  singular  numbers  (s-number) 
repeated  with  multiplicity  (which  is  finite  by  the  compactness).  These 
are  the  positive  square  root  of  the  eigenvalues  of 

[Hs]  *[HS3  • 

If  H  is  of  finite  rank  k  <  ”,  0  is  an  s~number  with  infinite  multiplicity 

o 

and  the  enumeration  will  be 


l=s0(Hs)>  ...  4  sk_i(Hs)>sk(Hs)  =sm(Hs)  =  ...  =  0. 
8 . 3  Theorem.  Let 

...  >  Vl(H  )  >  sm(H  >  1  ...  , 

+ 

then  there  exists  a  unique  q  of  exact  degree  m  satisfying  (8.6). 

Proof.  Denote  by  Bm  the  family  of  all  Blaschke  products  of  degree  £  m. 
By  a  result  of  Kronecker  (cf.  [35])  for  ^cL^  rank  H  s  m  iff  <J>  =  uh 

+  Up  to  coincidence. 


i 
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with  ueBm,  htH^.  Combining  with  Nehari's  theorem,  we  obtain 


inf  ||H  ||  =  inf  inf  |Jus  -  h |L  =  inf  inf  ||us-h-ug||T  = 

ueB  us  ueB  heH  “ucBm  g.heH  ~ 

m  m  °°  m  00 


=  inf  inf  ||(s  -  uh)  -  g||  =  inf { jjH  -  H  j|  :  rank  H  <  m}  =  s  (H  )  . 

geH  ueB  Loo  s  ^  V  ms 

&  00  m 

heH 

00 
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Let  q  be  the  inner  function  of  degree  m  such  that 
range  =  Hj  0  q  Hj  . 

It  follows  that  q  is  an  inner  divisor  of  u^  and  u2»  Since  u^,  u2  are  of 
degree  £m  we  conclude  that  Uj,  u2,  q  coincide  which  completes  the 
proof.  , 

8.4  Remark.  Note  that  for  m  =  k  =  rank  Hg  the  inner  function  q 
coincides  with  the  inner  function  q^  in  the  coprime  factorization  s  =  q^2 
(see  Theorem  7.3). 

This  q  is  induced  by  the  unique  rank  m  Hankel  approximant  H  via 

Ym 

X  =  H'  0  q  H‘  =  range  H  . 
n  Ym 

On  this  Markov  subspace  in  L2  (with  respect  to  X),  we  project 

F_y(0)  =  v  r,  and  thereby  obtain  the  approximating  process  y  induced  by 

q.  Indeed,  in  the  space  Hy  we  have 

Hy  D  Xq  -  F- 1[Xq'  • 

y(0)  is  obtained  by 

y(0)  =  Px  y(0) 

xq 

and  we  set  y(n)  =  Uny(0) 

Incorporating  [7] ,  we  thus  have  the  following  procedure  of 
constructing  the  approximating  process  y  with  its  density  f'~  which  are 


induced  by  q: 


Step  1.  Let  s  ,(H  )  >  sm(H  )  have  multiplicity  y,  and 
- *- —  m-l  sms 

C2*  •  ’  •  *  £ye  C)  be  the  corresponding  eigenvectors,  i. 

‘H.J'IH.Hj  =4(H.Kf  i  =  i.  2 . “ 

Let 

oo 

f,(z)  =  2  C.(n)zn€H9  3  =  1,2 . V 

J  n=0  J  J 

Factor 

f.(Z)  =  U.(z)  l^.(z) 

where  u^  is  inner,  ^  outer.  The  desired  function  q  is 
q  =  Uj  A  u2  A  ...  Au  , 

i.e.,  the  greatest  common  inner  divisor  for  the  u.  [7,  Th.  1.2]. 
Proposition  8.1  the  projection  of  xF  onto  X^  is  given  by 

q  tt+  q“  r  . 

Step  2.  Factor 

qTT+qxr  = 

where  <J>.  is  inner,  ipg  outer.  Observe  that 


Cn  =  (y(n)’  y(0))H  =  (F-y<n)*  F-y(0))L2  =  (X x<i)i ^  ’  x<J>iV  = 

=  (X\.  V  • 

Thus  is  an  outer  factor  for 

fyy(x)  -wl*i(eV  ' 

the  density  of  the  y  process. 

+ 

We  next  obtain  a  bound  on  the  normalized  difference 

~  cn 
C  0 

between  the  moments  cfi  =  (y(n),  y(0)),  cn  =  Cy(n) ,  y(0))  of  the 
corresponding  y,  y  processes. 

8.5  Proposition.  We  have 

c  -  c 

(8.7)  -  —  — n  <  2s  (H)  ,  n  =  0 ,  ±1 . 

cQ  m  & 

Proof.  From  the  discussion  preceding  problem  A~  and  the  proof  of 
Theorem  8.3,  we  obtain 

||y(n)  -  y(n)  |L  =  ||y(0)-y(0)  ||  =  l|Hqsr  |jL^< 

^  UHqs||  ||r  ||  =  sm(Hs)/57  . 

2 

+  To  the  variance  a  =  cn  of  the  y  process. 
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Thus , 


Vcn 

(y(n)  ,y (0))  -  (y(n)  ,y(0>) 

c0 

c0 

|(y(n),  y (0)  -  y (0) )  +  fy(n)  -  y(n) ,  y(0))| 


*  Sm (H„)  +  sm(HJ  — y - "  <  2s  (H„) 


m'  s  m'  s' 


'0 


m '  s 


We  make  a  number  of  observations.  First,  observe  that 
l|y(0)  -  y(0)||H  =  ||r  -  4>j^ellL2  . 

Thus,  can  not  be  seen  as  a  rational  approximation  in  L 2  norm  to  the 
outer  factor  T  of  f  .  If  rank  Hg  is  infinite  or  otherwise  if  the  singular 
values  drop  sharply,  (8.7)  demonstrates  that  the  process  y  is  a  rather 
good  approximate  to  y.  Second,  we  note  that  the  assumption  about 
the  compactness  of  Hs  is  not  a  necessary  one.  Indeed,  the  spectrum  of 
a  bounded  Hankel  operator  is  the  union  of  its  point  spectrum  and  its 
essential  spectrum  [34].  The  analytic  properties  of  Schmidt  pairs  were 
given  in  [7]  for  a  general  bounded  Hankel  operator  and,  thus,  our 
construction  carries  over  to  this  case  mutatis-mutandis.  The  multiplicity 
y  of  sm(Hg),  however,  may  be  infinite  in  this  case. 

We  now  characterize  the  compactness  condition  on  Hg  on  a  process 
level.  For  any  two  subspaces  M,  N  of  a  Hilbert  space  H  let 

p(M,N)  =  sup{j(f,g)H  j  :  f  (M,  geN ,  J|fJ!,||g||  4  1}. 

It  readily  follows  that  p  (M,N)  =  |  J  | .  If  p<  1,  M  ,N  are  said  to  be 


at  positive  angle  [22].  A  process  y  is  said  to  be  strongly  mixing 
(Yaglom  [43])  if 

p(H"  (0).  H*(n))  +  0  (n+~)  , 

3L 

i.e. ,  its  distant  past  and  future  are  nearly  orthogonal.  It  is  known 
(see  e.g.  [22])  that  a  regular  process  is  strongly  mixing  iff  the  operator 
P_P+  is  compact. 

Combining  with  Lemma  4.2  we  conclude  by  using  [32] 

8.6  Proposition,  y  is  strongly  mixing  iff  H„  is  compact,  i.e., 

Se  H  (B(CP))  +  C  (B(  CP ) ) .  + 

CO 

Processes  with  rational  density  are  thus  strongly  mixing.  Helson- 
Sarason  [22]  characterized  the  spectral  density  of  (1  dimensional) 
processes  which  are  strongly  mixing 

fyy  =  |P|2  exp  (u  +  v) 

where  u,v  are  continuous  on  T  ,  v  the  Harmonic  conjugate  of  v,  and  P 
is  a  polynomial  with  roots  on  T.  In  particular,  densities  which  are 
continuous  and  strictly  positive  are  in  this  class. 

8.7  Remark.  As  for  the  vector  generalization  of  the  above  construction, 
we  remark  that  if  H  is  the  best  rank  m  Hankel  approximant  to  Hs, 

+  C(B(C^))  denotes  the  continuous,  B(CP )  valued  functions  f  on 
the  circle. 


the  solution  can  be  carried  over  provided  we  guarantee  that  is 
strictly  noncyclic,  i.e.  , 


<f>  =  0*P 

m 


GeH^BCC15))  inner,  PeH^dKC*5)) ,  and,  moreover,  1 1 © (0)  | {  <  1.  ' 


did  not  find  a  way  to  guarantee  it. 
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